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TRANSPORTERS AND ION CHANNELS 

This invention relates to nucleic acid and amino acid sequences of transporters and ion 
channels and to the use of these sequences in the diagnosis, treatment, and prevention of transport, 
5 neurological, muscle, immunological and cell proliferative disorders, and in the assessment of the 
effects of exogenous compounds on the expression of nucleic acid and amino acid sequences of 
transporters and ion channels. 

BACKGROUND OF THE INVENTION 

10 Eukaryotic cells are surrounded and subdivided into functionally distinct organelles by 

hydrophobic lipid bilayer membranes which are highly impermeable to most polar molecules. Cells 
and organelles require transport proteins to import and export essential nutrients and metal ions . ✓ 
including K + , NH 4 \ Pj, S0 4 2 \ sugars, and vitamins, as well as various metabolic waste products. 
Transport proteins also play roles in antibiotic resistance, toxin secretion, ion balance, synaptic 

15 neurotransmission, kidney function, intestinal absorption, tumor growth, and other diverse cell 
functions (Griffith, J. and C. Sansom (1998) The Transporter Facts Book , Academic Press, San 
Diego CA, pp. 3-29). Transport can occur by a passive concentration-dependent mechanism, or can 
be linked to an energy source such as ATP hydrolysis or an ion gradient. Proteins that function in 
transport include carrier proteins, which bind to a specific solute and undergo a conformational 

20 change that translocates the bound solute across the membrane, and channel proteins, which form 
hydrophilic pores that allow specific solutes to diffuse through the membrane down an 
electrochemical solute gradient. 

Carrier proteins which transport a single solute from one side of the membrane to the other 
are called uniporters. In contrast, coupled transporters link the transfer of one solute with 

25 simultaneous or sequential transfer of a second solute, either in the same direction (symport) or in 
the opposite direction (antiport). For example, intestinal and kidney epithelium contains a variety 
of symporter systems driven by the sodium gradient that exists across the plasma membrane. 
Sodium moves into the cell down its electrochemical gradient and brings the solute into the cell 
with it. The sodium gradient that provides the driving force for solute uptake is maintained by the 

30 ubiquitous Na7K + ATPase system. Sodium-coupled transporters include the mammalian glucose 
transporter (SGLT1), iodide transporter (NIS), and multivitamin transporter (SMVT). All three 
transporters have twelve putative transmembrane segments, extracellular glycosylation sites, and 
cytoplasmically-oriented N- and C-termini. NIS plays a crucial role in the evaluation, diagnosis, 
and treatment of various thyroid pathologies because it is the molecular basis for radioiodide 

35 thyroid-imaging techniques and for specific targeting of radioisotopes to the thyroid gland (Levy, 

1 
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O. et al. (1997) Proc. Natl. Acad. Sci. USA 94:5568-5573). SMVT is expressed in the intestinal 
mucosa, kidney, and placenta, and is implicated in the transport of the water-soluble vitamins, e.g., 
biotin and pantothenate (Prasad, P.D. et al. (1998) J. Biol. Chem. 273:7501-7506). 

One of the largest families of transporters is the major facilitator superfamily (MFS), also 
5 called the uniporter-symporter-antiporter family. MFS transporters are single polypeptide carriers 
that transport small solutes in response to ion gradients. Members of the MFS are found in all 
classes of living organisms, and include transporters for sugars, oligosaccharides, phosphates, 
nitrates, nucleosides, monocarboxylates, and drugs. MFS transporters found in eukaryotes all have 
a structure comprising 12 transmembrane segments (Pao, S.S. et al. (1998) Microbiol. Molec. Biol. 
10 Rev. 62: 1 -34). The largest family of MFS transporters is the sugar transporter family, which 

includes the seven glucose transporters (GLUT1 -GLUT7) found in humans that are required for the 
transport of glucose and other hexose sugars. These glucose transport proteins have unique tissue ' 
distributions and physiological functions. GLUT1 provides many cell types with their basal glucose 
requirements and transports glucose across epithelial and endothelial barrier tissues; GLUT2 
15 facilitates glucose uptake or efflux from the liver; GLUT3 regulates glucose supply to neurons; 
GLUT4 is responsible for insulin-regulated glucose disposal; and GLUT5 regulates fructose uptake 
into skeletal muscle. Defects in glucose transporters are involved in a recently identified 
neurological syndrome causing infantile seizures and developmental delay, as well as glycogen 
storage disease, Fanconi-Bickel syndrome, and non-insulin-dependent diabetes mellitus (Mueckler, 
20 M. (1994) Eur. J. Biochem. 219:713-725; Longo, N. and LJ. Elsas (1998) Adv. Pediatr. 45:293- 
313). 

Monocarboxylate anion transporters are proton-coupled symporters with a broad substrate 
specificity that includes L-lactate, pyruvate, and the ketone bodies acetate, acetoacetate, and 
beta-hydroxybutyrate. At least seven isofonns have been identified to date. The isoforms are 

25 predicted to have twelve transmembrane (TM) helical domains with a large intracellular loop 
between TM6 and TM7, and play a critical role in maintaining intracellular pH by removing the 
protons that are produced stoichiometrically with lactate during glycolysis. The best characterized 
H + -monocarboxylate transporter is that of the erythrocyte membrane, which transports L-lactate and 
a wide range of other aliphatic monocarboxylates. Other cells possess HMinked monocarboxylate 

30 transporters with differing substrate and inhibitor selectivities. In particular, cardiac muscle and 
tumor cells have transporters that differ in their values for certain substrates, including 
stereoselectivity for L- over D-lactate, and in their sensitivity to inhibitors. There are 
Na + -monocarboxylate cotransporters on the luminal surface of intestinal and kidney epithelia, which 
allow the uptake of lactate, pyruvate, and ketone bodies in these tissues. In addition, there are 

35 specific and selective transporters for organic cations and organic anions in organs including the 
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kidney, intestine and liver. Organic anion transporters are selective for hydrophobic, charged 
molecules with electron-attracting side groups. Organic cation transporters, such as the ammonium 
transporter, mediate the secretion of a variety of drugs and endogenous metabolites, and contribute 
to the maintenance of intercellular pH (Poole, R.C. and A.P. Halestrap (1993) Am. J. Physiol. 
5 264:C761-C782; Price, N.T. et al. (1998) Biochem. J. 329:321-328; and Martinelle, K. and I. 
Haggstrom (1993) J. Biotechnol. 30:339-350). 

ATP-binding cassette (ABC) transporters are members of a superfamily of membrane 
proteins that transport substances ranging from small molecules such as ions, sugars, amino acids, 
peptides, and phospholipids, to lipopeptides, large proteins, and complex hydrophobic drugs. ABC 

10 transporters consist of four modules: two nucleotide-binding domains (NBD), which hydrolyze 
ATP to supply the energy required for transport, and two membrane-spanning domains (MSD), each 
containing six putative transmembrane segments. These four modules may be encoded by a single ' 
gene, as is the case for the cystic fibrosis transmembrane regulator (CFTR), or by separate genes. 
When encoded by separate genes, each gene product contains a single NBD and MSD. These "half- 

15 molecules" form homo- and heterodimers, such as Tapl and Tap2, the endoplasmic reticulum-based 
major histocompatibility (MHC) peptide transport system Several genetic diseases are attributed to 
defects in ABC transporters, such as the following diseases and their corresponding proteins: cystic 
fibrosis (CFTR, an ion channel), adrenoleukodystrophy (adrenoleukodystrophy protein, ALDP), 
Zellweger syndrome (peroxisomal membrane protein-70, PMP70), and hyperinsulinemic 

20 hypoglycemia (sulfonylurea receptor, SUR). Overexpression of the multidrug resistance (MDR) 
protein, another ABC transporter, in human cancer cells makes the cells resistant to a variety of 
cytotoxic drugs used in chemotherapy (Taglicht, D. and S. Michaelis (1998) Meth. Enzymol. 
292:130-162). 

A number of metal ions such as iron, zinc, copper, cobalt, manganese, molybdenum, 
25 selenium, nickel, and chromium are important as cofactors for a number of enzymes. For example, 
copper is involved in hemoglobin synthesis, connective tissue metabolism, and bone development, 
by acting as a cofactor in oxidoreductases such as superoxide dismutase, ferroxidase 
(ceruloplasmin), and lysyl oxidase. Copper and other metal ions must be provided in the diet, and 
are absorbed by transporters in the gastrointestinal tract. Plasma proteins transport the metal ions to 
30 the liver and other target organs, where specific transporters move the ions into cells and cellular 
organelles as needed. Imbalances in metal ion metabolism have been associated with a number of 
disease states (Danks, D.M. (1986) J. Med. Genet. 23:99-106). 

P-type ATPases comprise a class of cation-transporting transmembrane proteins. They are 
integral membrane proteins which use an aspartyl phosphate intermediate to move cations across a 
35 membrane. Features of P-type ATPases include: (i) a cation channel; (ii) a stalk, formed by 
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extensions of the transmembrane a-helices into the cytoplasm; (iii) an ATP binding domain; (iv) a 
phosphorylated aspartic acid; (v) an adjacent transduction domain; (vi) a phosphatase domain, 
which removes the phosphate from the aspartic acid as part of the reaction cycle; and (vii) six or 
more transmembrane domains. Included in this class are heavy metal-transporting ATPases as well 
5 as aminophospholipid transporters. 

The transport of phosphatidylserine and phosphatidylethanolamine by aminophospholipid 
translocase results in the movement of these molecules from one side of a bilayer to another. This 
transport is conducted by a newly identified subfamily of P-type ATPases which are proposed to be 
amphipath transporters. Amphipath transporters move molecules having both a hydrophilic and a 
10 hydrophobic region. As many as seventeen different genes belong to this P-type ATPases 

subfamily, being grouped into several distinct classes and subclasses (Halleck, M.S. et al., (1999) 
Physiol. Genomics 1:139-150; Vulpe.C. Et al., (1993) Nat. Genet. 3:7-13). 

Transport of fatty acids across the plasma membrane can occur by diffusion, a high 
capacity, low affinity process. However, under normal physiological conditions a significant 
15 fraction of fatty acid transport appears to occur via a high affinity, low capacity protein-mediated 
transport process. Fatty acid transport protein (FATP), an integral membrane protein with four 
transmembrane segments, is expressed in tissues exhibiting high levels of plasma membrane fatty 
acid flux, such as muscle, heart, and adipose. Expression of FATP is upregulated in 3T3-L1 cells 
during adipose conversion, and expression in COS7 fibroblasts elevates uptake of long-chain fatty 
20 acids (Hui.T.Y.etal. (1998) J. Biol. Chem. 273:27420-27429). 

The lipocalin superfamily constitutes a phylogenetically conserved group of more than forty 
proteins that function as extracellular hgand-binding proteins which bind and transport small 
hydrophobic molecules. Members of this family function as carriers of retinoids, odorants, 
chromophores, pheromones, allergens, and sterols, and in a variety of processes including nutrient 
25 transport, cell growth regulation, immune response, and prostaglandin synthesis. A subset of these 
proteins may be multifunctional, serving as either a biosynthetic enzyme or as a specific enzyme 
inhibitor. (Tanaka, T. et al. (1997) J. Biol. Chem. 272:15789-15795; and van't Hof, W. et al. (1997) 
J. Biol. Chem. 272:1837-1841.) 

Members of the lipocalin family display unusually low levels of overall sequence 
30 conservation. Pairwise sequence identity often falls below 20%. Sequence similarity between 
family members is limited to conserved cysteines which form disulfide bonds and three motifs 
which form a juxtaposed cluster that functions as a target cell recognition site. The lipocalins share 
an eight stranded, anti-parallel beta-sheet which folds back on itself to form a continuously 
hydrogen-bonded beta-barrel. The pocket formed by the barrel functions as an internal ligand 
35 binding site. Seven loops (LI to L7) form short beta-hairpins, except loop LI which is a large 
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omega loop that forms a lid to partially close the internal ligand-binding site (Flower (1996) 
Biochem. J. 318:1-14). 

Lipocalins are important transport molecules. Each lipocalin associates with a particular 
ligand and delivers that ligand to appropriate target sites within the organism. Retinol-binding 
5 protein (RBP), one of the best characterized lipocalins, transports retinol from stores within the liver 
to target tissues. Apolipoprotein D (apo D), a component of high density lipoproteins (HDLs) and 
low density lipoproteins (LDLs), functions in the targeted collection and delivery of cholesterol 
throughout the body. Lipocalins are also involved in cell regulatory processes. Apo D, which is 
identical to gross-cystic-disease-fluid protein (GCDFP)-24, is a progesterone/pregnenolone-binding 

10 protein expressed at high levels in breast cyst fluid. Secretion of apo D in certain human breast 
cancer cell lines is accompanied by reduced cell proliferation and progression of cells to a more 
differentiated phenotype. Similarly, apo D and another lipocalin, <x r acid glycoprotein (AGP), are ' 
involved in nerve cell regeneration. AGP is also involved in anti-inflammatory and 
immunosuppressive activities. AGP is one of the positive acute-phase proteins (APP); circulating 

15 levels of AGP increase in response to stress and inflammatory stimulation. AGP accumulates at 
sites of inflammation where it inhibits platelet and neutrophil activation and inhibits phagocytosis. 
The immunomodulatory properties of AGP are due to glycosylation. AGP is 40% carbohydrate, 
making it unusually acidic and soluble. The glycosylation pattern of AGP changes during acute- 
phase response, and deglycosylated AGP has no immunosuppressive activity (Flower (1994) FEBS 

20 Lett. 354:7-1 1; Flower (1996) supra) . 

The lipocalin superfamily also includes several animal allergens, including the mouse major 
urinary protein (mMUP), the rat a-2-microgloobulin (rA2U), the bovine P-lactoglobulin (pig), the 
cockroach allergen (Bla g4), bovine dander allergen (Bos d2), and the major horse allergen, 
designated Equus caballus allergen 1 (Equ cl). Equ cl is a powerful allergen responsible for about 

25 80% of anti-horse IgE antibody response in patients who are chronically exposed to horse allergens. 
It appears that lipocalins may contain a common structure that is able to induce the IgE response 
(Gregoire, C. et al., (1996) J. Biol. Chem. 271:32951-32959). 

Lipocalins are used as diagnostic and prognostic markers in a variety of disease states. The 
plasma level of AGP is monitored during pregnancy and in diagnosis and prognosis of conditions 

30 including cancer chemotherapy, renal disfunction, myocardial infarction, arthritis, and multiple 
sclerosis. RBP is used clinically as a marker of tubular reabsorption in the kidney, and apo D is a 
marker in gross cystic breast disease (Flower (1996) supra) . Additionally, the use of lipocalin 
animal allergens may help in the diagnosis of allergic reactions to horses (Gregoire supra) , pigs, 
cockroaches, mice and rats. 

35 Mitochondrial carrier proteins are transmembrane-spanning proteins which transport ions 
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and charged metabolites between the cytosol and the mitochondrial matrix. Examples include the 
ADP, ATP carrier protein; the 2-oxoglutarate/malate carrier; the phosphate carrier protein; the 
pyruvate carrier; the dicarboxylate carrier which transports malate, succinate, fumarate, and 
phosphate; the tricarboxylate carrier which transports citrate and malate; and the Grave's disease 
5 carrier protein, a protein recognized by IgG in patients with active Grave' s disease, an autoimmune 
disorder resulting in hyperthyroidism. Proteins in this family consist of three tandem repeats of an 
approximately 100 amino acid domain, each of which contains two transmembrane regions (Stryer, 
L. (1995) Biochemistry, W.H. Freeman and Company, New York NY, p. 551; PROSITE 
PDOC00189 Mitochondrial energy transfer proteins signature; Online Mendelian Inheritance in 
10 Man (OMDvI) *275000 Graves Disease). 

This class of transporters also includes the mitochondrial uncoupling proteins, which create 
proton teaks across the inner mitochondrial membrane, thus uncoupling oxidative phosphorylation' 
from ATP synthesis. The result is energy dissipation in the form of heat. Mitochondrial uncoupling 
proteins have been implicated as modulators of thermoregulation and metabolic rate, and have been 
15 proposed as potential targets for drugs against metabolic diseases such as obesity (Ricquier, D. et al. 
(1999) J. Int. Med. 245:637-642). 
Ion Channels 

The electrical potential of a cell is generated and maintained by controlling the movement 
of ions across the plasma membrane. The movement of ions requires ion channels, which form ion- 

20 selective pores within the membrane. There are two basic types of ion channels, ion transporters 
and gated ion channels. Ion transporters utilize the energy obtained from ATP hydrolysis to actively 
transport an ion against the ion's concentration gradient. Gated ion channels allow passive flow of 
an ion down the ion's electrochemical gradient under restricted conditions. Together, these types of 
ion channels generate, maintain, and utilize an electrochemical gradient that is used in 1) electrical 

25 impulse conduction down the axon of a nerve cell, 2) transport of molecules into cells against 
concentration gradients, 3) initiation of muscle contraction, and 4) endocrine cell secretion. 
Ion Transporters 

Ion transporters generate and maintain the resting electrical potential of a cell. Utilizing the 
energy derived from ATP hydrolysis, they transport ions against the ion's concentration gradient. 

30 These transmembrane ATPases are divided into three families. The phosphorylated (P) class ion 
transporters, including Na + -K + ATPase, Ca 2+ -ATPase, and H + -ATPase, are activated by a 
phosphorylation event. P-class ion transporters, also known as E1-E2 type ATPases, are responsible 
for maintaining resting potential distributions such that cytosolic concentrations of Na + and Ca 2+ are 
low and cytosolic concentration of K + is high. The vacuolar (V) class of ion transporters includes 

35 H + pumps on intracellular organelles, such as lysosomes and Golgi. V-class ion transporters are 
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responsible for generating the low pH within the lumen of these organelles that is required for 
function. The coupling factor (F) class consists of H + pumps in the mitochondria. F-class ion 
transporters utilize a proton gradient to generate ATP from ADP and inorganic phosphate (Pi). 

The P-ATPases are hexamers of a 100 kD subunit with ten transmembrane domains and 
5 several large cytoplasmic regions that may play a role in ion binding (Scarborough, G.A. (1999) 
Curr. Opin. Cell Biol. 11:517-522). The V-ATPases are composed of two functional domains: the 
V, domain, a peripheral complex responsible for ATP hydrolysis; and the V 0 domain, an integral 
complex responsible for proton translocation across the membrane. The F-ATPases are structurally 
and evolutionarily related to the V-ATPases. The F-ATPase F 0 domain contains 12 copies of the c 

10 subunit, a highly hydrophobic protein composed of two transmembrane domains and containing a 
single buried carboxyl group in TM2 that is essential for proton transport. The V-ATPase V 0 
domain contains three types of homologous c subunits with four or five transmembrane domains ' 
and the essential carboxyl group in TM4 or TM3. Both types of complex also contain a single a 
subunit that may be involved in regulating the pH dependence of activity (Forgac, M. (1999) J. 

15 Biol. Chem. 274:12951-12954). 

The resting potential of the cell is utilized in many processes involving carrier proteins and 
gated ion channels. Carrier proteins utilize the resting potential to transport molecules into and out 
of the cell. Amino acid and glucose transport into many cells is linked to sodium ion co-transport 
(symport) so that the movement of Na + down an electrochemical gradient drives transport of the 

20 other molecule up a concentration gradient. Similarly, cardiac muscle links transfer of Ca 2+ out of 
the cell with transport of Na + into the cell (antiport). 
Gated Ion Channels 

Gated ion channels control ion flow by regulating the opening and closing of pores. The 
ability to control ion flux through various gating mechanisms allows ion channels to mediate such 

25 diverse signaling and homeostatic functions as neuronal and endocrine signaling, muscle 

contraction, fertilization, and regulation of ion and pH balance. Gated ion channels are categorized 
according to the manner of regulating the gating function. Mechanically-gated channels open their 
pores in response to mechanical stress; voltage-gated channels (e.g., Na + , K + , Ca 2+ , and CI" channels) 
open their pores in response to changes in membrane potential; and ligand-gated channels (e.g., 

30 acetylcholine-, serotonin-, and glutamate-gated cation channels, and GAB A- and glycine-gated 
chloride channels) open their pores in the presence of a specific ion, nucleotide, or neurotransmitter. 
The gating properties of a particular ion channel (i.e., its threshold for and duration of opening and 
closing) are sometimes modulated by association with auxiliary channel proteins and/or post 
translational modifications, such as phosphorylation. 

35 Mechanically-gated or mechanosensitive ion channels act as transducers for the senses of 

7 
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touch, hearing, and balance, and also play important roles in cell volume regulation, smooth muscle 
contraction, and cardiac rhythm generation. A stretch-inactivated channel (SIC) was recently 
cloned from rat kidney. The SIC channel belongs to a group of channels which are activated by 
pressure or stress on the cell membrane and conduct both Ca 2+ and Na + (Suzuki, M. et al. (1999) J. 
5 Biol. Chem. 274:6330-6335). 

The pore-forming subunits of the voltage-gated cation channels form a superfamily of ion 
channel proteins. The characteristic domain of these channel proteins comprises six transmembrane 
domains (S1-S6), a pore-forming region (P) located between S5 and S6, and intracellular amino and 
carboxy termini. In the Na + and Ca 2+ subfamilies, this domain is repeated four times, while in the 

10 K + channel subfamily, each channel is formed from a tetramer of either identical or dissimilar 
subunits. The P region contains information specifying the ion selectivity for the channel. In the 
case of K+ channels, a GYG tripeptide is involved in this selectivity (Ishii, T.M. et al. (1997) Proc. ' 
Natl. Acad. Sci. USA 94:11651-11656). 

Voltage-gated Na + and K + channels are necessary for the function of electrically excitable 

15 cells, such as nerve and muscle cells. Action potentials, which lead to neurotransmitter release and 
muscle contraction, arise from large, transient changes in the permeability of the membrane to Na * 
and K + ions. Depolarization of the membrane beyond the threshold level opens voltage-gated Na + 
channels. Sodium ions flow into the cell, further depolarizing the membrane and opening more 
voltage-gated Na + channels, which propagates the depolarization down the length of the cell. 

20 Depolarization also opens voltage-gated potassium channels. Consequently, potassium ions flow 
outward, which leads to repolarization of the membrane. Voltage-gated channels utilize charged 
residues in the fourth transmembrane segment (S4) to sense voltage change. The open state lasts 
only about 1 millisecond, at which time the channel spontaneously converts into an inactive state 
that cannot be opened irrespective of the membrane potential. Inactivation is mediated by the 

25 channel's N-terminus, which acts as a plug that closes the pore. The transition from an inactive to a 
closed state requires a return to resting potential. 

Voltage-gated Na + channels are heterotrimeric complexes composed of a 260 kDa pore- 
forming a subunit that associates with two smaller auxiliary subunits, pi and 02. The 02 subunit is 
a integral membrane glycoprotein that contains an extracellular Ig domain, and its association with a 

30 and pi subunits correlates with increased functional expression of the channel, a change in its 
gating properties, as well as an increase in whole cell capacitance due to an increase in membrane 
surface area (Isom, L.L. et al. (1995) Cell 83:433-442). 

Non voltage-gated Na + channels include the members of the amiloride-sensitive Na + 
channel/degenerin (NaC/DEG) family. Channel subunits of this family are thought to consist of two 

35 transmembrane domains flanking a long extracellular loop, with the amino and carboxyl termini 
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located within the cell. The NaC/DEG family includes the epithelial Na + channel (ENaC) involved 
in Na + reabsorption in epithelia including the airway, distal colon, cortical collecting duct of the 
kidney, and exocrine duct glands. Mutations in ENaC result in pseudohypoaldosteronism type 1 
and Liddle's syndrome (pseudohyperaldosteronism). The NaC/DEG family also includes the 
5 recently characterized IT-gated cation channels or acid-sensing ion channels (ASIC). ASIC 
subunits are expressed in the brain and form heteromultimeric Na + -permeable channels. These 
channels require acid pH fluctuations for activation. ASIC subunits show homology to the 
degenerins, a family of mechanically-gated channels originally isolated from C. elegans. Mutations 
in the degenerins cause neurodegeneration. ASIC subunits may also have a role in neuronal 
10 function, or in pain perception, since tissue acidosis causes pain (Waldmann, R. and M. Lazdunski 
(1998) Out. Opin. Neurobiol. 8:418-424; Eglen, R.M. et al. (1999) Trends Pharmacol. Sci. 20:337- 
342), 

K + channels are located in all cell types, and may be regulated by voltage, ATP 
concentration, or second messengers such as Ca 2+ and cAMP. In non-excitable tissue, K + channels 

15 are. involved in protein synthesis, control of endocrine secretions, and the maintenance of osmotic 
equilibrium across membranes. In neurons and other excitable cells, in addition to regulating action 
potentials and repolarizing membranes, K + channels are responsible for setting the resting 
membrane potential. The cytosol contains non-diffusible anions and, to balance this net negative 
charge, the cell contains a Na + -K + pump and ion channels that provide the redistribution of Na + , K + , 

20 and Cl\ The pump actively transports Na + out of the cell and K + into the cell in a 3:2 ratio. Ion 
channels in the plasma membrane allow K + and CI " to flow by passive diffusion. Because of the 
high negative charge within the cytosol, CI " flows out of the cell. The flow of K + is balanced by an 
electromotive force pulling K + into the cell, and a K + concentration gradient pushing K + out of the 
cell. Thus, the resting membrane potential is primarily regulated by K + flow (Salkoff, L. and T. 

25 Jegla (1995) Neuron 15:489-492). 

Potassium channel subunits of the Shaker -like superfamily all have the characteristic six 
transmembrane/1 pore domain structure. Four subunits combine as homo- or heterotetramers to 
form functional K channels. These pore-forming subunits also associate with various cytoplasmic 0 
subunits that alter channel inactivation kinetics. The Shaker -like channel family includes the 

30 voltage-gated K + channels as well as the delayed rectifier type channels such as the human ether-a- 
go-go related gene (HERG) associated with long QT, a cardiac dysrythmia syndrome (Curran, M.E. 
(1998) Curr. Opin. Biotechnol. 9:565-572; Kaczorowski, G.J. and M.L. Garcia (1999) Curr. Opin. 
Chem. Biol. 3:448-458). 

A second superfamily of K + channels is composed of the inward rectifying channels (Kir). 

35 Kir channels have the property of preferentially conducting K + currents in the inward direction. 
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These proteins consist of a single potassium selective pore domain and two transmembrane 
domains, which correspond to the fifth and sixth transmembrane domains of voltage-gated K + 
channels. Kir subunits also associate as tetramers. The Kir family includes ROMK1, mutations in 
which lead to Bartter syndrome, a renal tubular disorder. Kir channels are also involved in 
5 regulation of cardiac pacemaker activity, seizures and epilepsy, and insulin regulation (Doupnik, 
CA. et al. (1995) Curr. Opin. Neurobiol. 5:268-277; Curran, supra) . 

The recently recognized TWIK K + channel family includes the mammalian TWIK-1, 
TREK-1 and TASK proteins. Members of this family possess an overall structure with four 
transmembrane domains and two P domains. These proteins are probably involved in controlling 

10 the resting potential in a large set of cell types (Duprat, F. et al. (1997) EMBO J 16:5464-5471). 

The voltage-gated Ca :# channels have been classified into several subtypes based upon their 
electrophysiological and pharmacological characteristics. L-type Ca 2+ channels are predominantly 
expressed in heart and skeletal muscle where they play an essential role in excitation-contraction 
coupling. T-typc channels arc important for cardiac pacemaker activity, while N-type and P/Q-type 

15 channels are involved in the control of neurotransmitter release in the central and peripheral nervous 
system. The L-typc and N-iypc voltage-gated Ca 2+ channels have been purified and, though their 
functions differ dramatically, they have similar subunit compositions. The channels are composed 
of three subunits. The a, subunit forms the membrane pore and voltage sensor, while the o^S and p 
subunits modulate the voltage-dependence, gating properties, and the current amplitude of the 

20 channel. These subunits are encoded by at least six a„ one c^S, and four p genes. A fourth subunit, 
y, has been identified in skeletal muscle (Walker, D. et al. (1998) J. Biol. Chem. 273:2361-2367; 
McCleskey, E.W. (1994) Curr. Opin. Neurobiol. 4:304-312). 

The high-voltage-activated Ca 2+ channels that have been characterized biochemically 
include complexes of a pore-forming alphal subunit of approximately 190-250 kDa; a 

25 transmembrane complex of alpha2 and delta subunits; an intracellular beta subunit; and in some 
cases a transmembrane gamma subunit. A variety of alphal subunits, alpha2delta complexes, beta 
subunits, and gamma subunits are known. The Cavl family of alphal subunits conduct L-type Ca 2+ 
currents, which initiate muscle contraction, endocrine secretion, and gene transcription, and are 
regulated primarily by second messenger-activated protein phosphorylation pathways. The Cav2 

30 family of alphal subunits conduct N-type, P/Q-type, and R-type Ca 2+ currents, which initiate rapid 
synaptic transmission and are regulated primarily by direct interaction with G proteins and SNARE 
proteins and secondarily by protein phosphorylation. The Cav3 family of alphal subunits conduct 
T-type Ca 2+ currents, which are activated and inactivated more rapidly and at more negative 
membrane potentials than other Ca 2 * current types. The distinct structures and patterns of 

35 regulation of these three families of Ca 2+ channels provide an anray of Ca 2+ entry pathways in 

10 



3NSOOCID: <WO 024641 5A2_I_> 



WO 02/46415 



PCT/US01/46963 



response to changes in membrane potential and a range of possibilities for regulation of Ca 2+ entry 
by second messenger pathways and interacting proteins (Catterall, W. A. (2000) Annu. Rev. Cell 
Dev. Biol. 16:521-555). 

The alpha-2 subunit of the voltage-gated Ca 2+ -channel may include one or more Cache 

5 domains. An extracellular Cache domain may be fused to an intracellular catalytic domain, such as 
the histidine kinase, PP2C phosphatase, GGDEF (a predicted diguanylate cyclase), HD-GYP (a 
predicted phosphodiesterase) or adenylyl cyclase domain, or to a noncatalytic domain, like the 
methyl-accepting, DNA-binding winged helix-turn-helix, GAF, PAS or HAMP (a domain found in 
istidinc kinases, denylyl cyclases, ethyl-binding proteins and phosphatases). Small molecules are 

10 bound via the Cache domain and this signal is converted into diverse outputs depending on the 
intracellular domains (Anantharaman, V. and Aravind, L.(2000) Trends Biochenx Sci. 25:535-537). 

The transient receptor family (Trp) of calcium ion channels are thought to mediate . ✓ 

capacitative calcium entry (CCE). CCE is the Ca 2+ influx into cells to resupply Ca 2+ stores depleted 
by the action of inositol triphosphate (IP3) and other agents in response to numerous hormones and 

15 growth factors. Trp and Trp-like were first cloned from Drosophila and have similarity to voltage 
gated Ca 2 * channels in the S3 through S6 regions. This suggests that Trp and/or related proteins 
may form mammalian CCE channels (Zhu, X. et al. (1996) Cell 85:661-671; Boulay, G. et al. (1997) 
J. Biol. Chem. 272:29672-29680). Melastatin is a gene isolated in both the mouse and human, 
whose expression in melanoma cells is inversely correlated with melanoma aggressiveness in vivo . 

20 The human cDNA transcript corresponds to a 1533-amino acid protein having homology to 
members of the Trp family. It has been proposed that the combined use of malastatin mRNA 
expression status and tumor thickness might allow for the determination of subgroups of patients at 
both low and high risk for developing metastatic disease (Duncan, L.M. et al (2001) J. Clin. Oncol. 
19:568-576). 

25 Chloride channels are necessary in endocrine secretion and in regulation of cytosolic and 

organelle pH. In secretory epithelial cells, CI" enters the cell across a basolateral membrane through 
an Na + , K7C1" cotransporter, accumulating in the cell above its electrochemical equilibrium 
concentration. Secretion of CI " from the apical surface, in response to hormonal stimulation, leads 
to flow of Na + and water into the secretory lumen. The cystic fibrosis transmembrane conductance 

30 regulator (CFTR) is a chloride channel encoded by the gene for cystic fibrosis, a common fatal 
genetic disorder in humans. CFTR is a member of the ABC transporter family, and is composed of 
two domains each consisting of six transmembrane domains followed by a nucleotide-binding site. 
Loss of CFTR function decreases transepithelial water secretion and, as a result, the layers of mucus 
that coat the respiratory tree, pancreatic ducts, and intestine are dehydrated and difficult to clear. 

35 The resulting blockage of these sites leads to pancreatic insufficiency, "meconium ileus", and 
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devastating "chronic obstructive pulmonary disease" (AI-Awqati, Q. et al. (1992) J. Exp. Biol. 
172:245-266). 

The voltage-gated chloride channels (CLC) are characterized by 10-12 transmembrane 
domains, as well as two small globular domains known as CBS domains. The CLC subunits 
5 probably function as homotetramers. CLC proteins are involved in regulation of cell volume, 
membrane potential stabilization, signal transduction, and transepithelial transport. Mutations in 
CLC-1, expressed predominantly in skeletal muscle, are responsible for autosomal recessive 
generalized myotonia and autosomal dominant myotonia congenita, while mutations in the kidney 
channel CLC-5 lead to kidney stones (Jentsch, TJ. (1996) Curr. Opin. Neurobiol. 6:303-310). 
10 Ligand-gated channels open their pores when an extracellular or intracellular mediator 

binds to the channel. Neurotransmitter-gated channels are channels that open when a 
neurotransmitter binds to their extracellular domain. These channels exist in the postsynaptic 
membrane of nerve or muscle cells. There are two types of neurotransmitter-gated channels. 
Sodium channels open in response to excitatory neurotransmitters, such as acetylcholine, glutamate, 
15 and serotonin. This opening causes an influx of Na + and produces the initial localized 

depolarization that activates the voltage-gated channels and starts the action potential. Chloride 
channels open in response to inhibitory neurotransmitters, such as y-aininobutyric acid (GABA) and 
glycine, leading to hyperpolarization of the membrane and the subsequent generation of an action 
potential. Neurotransmitter-gated ion channels have four transmembrane domains and probably 
20 function as pentamers (Jentsch, supra). Amino acids in the second transmembrane domain appear to 
be important in determining channel permeation and selectivity (Sather, W.A. et al. (1994) Curr. 
Opih. Neurobiol. 4:313-323). 

Ligand-gated channels can be regulated by intracellular second messengers. For example, 
calcium-activated K + channels are gated by internal calcium ions. In nerve cells, an influx of 
25 calcium during depolarization opens K + channels to modulate the magnitude of the action potential 
(Ishi et al., supra). The large conductance (BK) channel has been purified from brain and its subunit 
composition determined. The cc subunit of the BK channel has seven rather than six transmembrane 
domains in contrast to voltage-gated K+ channels. The extra transmembrane domain is located at 
the subunit N-terminus. A 28-amino-acid stretch in the C-terminal region of the subunit (the 
30 "calcium bowl" region) contains many negatively charged residues and is thought to be the region 
responsible for calcium binding. The P subunit consists of two transmembrane domains connected 
by a glycosylated extracellular loop, with intracellular N- and C-termini (Kaczorowski, supra : 
Vergara, C. et al. (1998) Curr. Opin. Neurobiol. 8:321-329). 

Cyclic nucleotide-gated (CNG) channels are gated by cytosolic cyclic nucleotides. The best 
35 examples of these are the c AMP-gated Na + channels involved in olfaction and the cGMP-gated 
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cation channels involved in vision. Both systems involve ligand-mediated activation of a G-protein 
coupled receptor which then alters the level of cyclic nucleotide within the cell. CNG channels also 
represent a major pathway for Ca 2+ entry into neurons, and play roles in neuronal development and 
plasticity. CNG channels are tetramers containing at least two types of subunits, an a subunit which 

5 can form functional homomeric channels, and a P subunit, which modulates the channel properties. 
All CNG subunits have six transmembrane domains and a pore forming region between the fifth and 
sixth transmembrane domains, similar to voltage-gated K + channels. A large C-terminal domain 
contains a cyclic nucleotide binding domain, while the N-terminal domain confers variation among 
channel subtypes (Zufall, F. et al. (1997) Curr. Opin. Neurobiol. 7:404-412). 

10 The activity of other types of ion channel proteins may also be modulated by a variety of 

intracellular signalling proteins. Many channels have sites for phosphorylation by one or more 
protein kinases including protein kinase A, protein kinase C, tyrosine kinase, and casein kinase II, ' 
all of which regulate iori channel activity in cells. Kir channels are activated by the binding of the 
GpY subunits of heterotrimeric G-proteins (Reimann, F. and F.M. Ashcroft (1999) Curr. Opin. Cell. 

15 Biol. 1 1 :503-508). Other proteins are involved in the localization of ion channels to specific sites in 
the cell membrane. Such proteins include the PDZ domain proteins known as MAGUKs 
(membrane-associated guanylate kinases) which regulate the clustering of ion channels at neuronal 
synapses (Craven, S.E. and D.S. Bredt (1998) Cell 93:495-498). 
Disease Correlation 

20 The etiology of numerous human diseases and disorders can be attributed to defects in the 

transport of molecules across membranes. Defects in the trafficking of membrane-bound 
transporters and ion channels are associated with several disorders, e.g., cystic fibrosis, 
glucose-galactose malabsorption syndrome, hypercholesterolemia, von Gierke disease, and certain 
forms of diabetes mellitus. Single-gene defect diseases resulting in an inability to transport small 

25 molecules across membranes include, e.g., cystinuria, iminoglycinuria, Hartup disease, and Fanconi 
disease (van't Hoff, W.G. (1996) Exp. Nephrol. 4:253-262; Talente, G.M. et al. (1994) Ann. Intern. 
Med. 120:21 8-226; and Chillon, M. et al. (1995) New Engl J. Med. 332: 1475-1480). 

Human diseases caused by mutations in ion channel genes include disorders of skeletal 
muscle, cardiac muscle, and the central nervous system. Mutations in the pore-forming subunits of 

30 sodium and chloride channels cause myotonia, a muscle disorder in which relaxation after voluntary 
contraction is delayed. Sodium channel myotonias have been treated with channel blockers. 
Mutations in muscle sodium and calcium channels cause forms of periodic paralysis, while 
mutations in the sarcoplasmic calcium release channel, T-tubule calcium channel, and muscle 
sodium channel cause malignant hyperthermia. Cardiac arrythmia disorders such as the long QT 

35 syndromes and idiopathic ventricular fibrillation are caused by mutations in potassium and sodium 
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channels (Cooper, E.C. and L.Y. Jan (1998) Proc. Natl. Acad. Sci. USA 96:4759-4766). All four 
known human idiopathic epilepsy genes code for ion channel proteins (Berkovic, S.F. and LE. 
Scheffer (1999) Curr. Opin. Neurology 12:177-182). Other neurological disorders such as ataxias, 
hemiplegic migraine and hereditary deafness can also result from mutations in ion channel genes 
5 (Jen, J. (1999) Curr. Opin. Neurobiol. 9:274-280; Cooper, supra) . 

Ion channels have been the target for many drug therapies. Neurotransmitter-gated channels 
have been targeted in therapies for treatment of insomnia, anxiety, depression, and schizophrenia. 
Voltage-gated channels have been targeted in therapies for arrhythmia, ischemic stroke, head 
trauma, and neurodegenerative disease (Taylor, CP. and L.S. Narasimhan (1997) Adv. Pharmacol. 

10 39:47-98). Various classes of ion channels also play an important role in the perception of pain, and 
thus are potential targets for new analgesics. These include the vanilloid-gated ion channels, which 
are activated by the vanilloid capsaicin, as well as by noxious heat. Local anesthetics such as . ✓ 
lidocaine and mexiletine which blockade voltage-gated Na + channels have beenuseful in the 
treatment of neuropathic pain (Eglen, supra\ Calcium-channel protein expression is altered in 

15 metastatic melanomas (Enklaar, T. et al. (2000) Genomics 67:179-187. 

Ion channels in the immune system have recently been suggested as targets for 
immunomodulation. T-cell activation depends upon calcium signaling, and a diverse set of T-cell 
specific ion channels has been characterized that affect this signaling process. Channel blocking 
agents can inhibit secretion of lymphokines, cell proliferation, and killing of target cells. A peptide 

20 antagonist of the T-cell potassium channel Kvl.3 was found to suppress delayed-type 

hypersensitivity and allogenic responses in pigs, validating the idea of channel blockers as safe and 
efficacious immunosuppressants (Cahalan, M.D. and K.G. Chandy (1997) Cuit. Opin. Biotechnol. 
8:749-756). 

The discovery of new transporters and ion channels, and the polynucleotides encoding 
25 them, satisfies a need in the art by providing new compositions which are useful in the diagnosis, 
prevention, and treatment of transport, neurological, muscle, immunological and cell proliferative 
disorders, and in the assessment of the effects of exogenous compounds on the expression of nucleic 
acid and amino acid sequences of transporters and ion channels. 

30 SUMMARY OF THE INVENTION 

The invention features purified polypeptides, transporters and ion channels, referred to 
collectively as "TRICH" and individually as 'TRICH-1," 'TRICH-2," 'TRICH-3," 'TRICH-4," 
"TRICH-5," 'TRICH-6," "TRICH-7," "TRICH-8," 'TRICH-9," 'TRICH-10," 'TRICH-1 1," 
"TRICH-12," 'TRICH-13," 'TRICH-14," 'TRICH-15," 'TRICH-16," "TRICH-17," 'TRICH-18," 
35 'TRICH-19," 'TRICH-20," "TRICH-21," 'TRICH-22," 'TRICH-23," 'TRICH-24," 'TRICH-25," 
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"TRICH-26," 'TRICH-27," "TRICH-28," 'TRICH-29," 'TRICH-30," "TRICH-31," and t4 TRICH- 
32." In one aspect, the invention provides an isolated polypeptide selected from the group 
consisting of a) a polypeptide comprising an amino acid sequence selected from the group 
consisting of SEQ ID NO: 1-32, b) a polypeptide comprising a naturally occurring amino acid 
5 sequence at least 90% identical to an amino acid sequence selected from the group consisting of 
SEQ ID NO: 1-32, c) a biologically active fragment of a polypeptide having an amino acid sequence 
selected from the group consisting of SEQ ID NO: 1-32, and d) an immunogenic fragment of a 
polypeptide having an amino acid sequence selected from the group consisting of SEQ ID NO: 1-32. 
In one alternative, the invention provides an isolated polypeptide comprising the amino acid 

10 sequence of SEQ ID NO: 1-32. 

The invention further provides an isolated polynucleotide encoding a polypeptide selected 
from the group consisting of a) a polypeptide comprising an amino acid sequence selected from the 
group consisting of SEQ ID NO: 1-32, b) a polypeptide comprising a naturally occurring amino acid 
sequence at least 90% identical to an amino acid sequence selected from the group consisting of 

15 SEQ ID NO: 1-32, c) a biologically active fragment of a polypeptide having an amino acid sequence 
selected from the group consisting of SEQ ID NO: 1-32, and d) an immunogenic fragment of a 
polypeptide having an amino acid sequence selected from the group consisting of SEQ ID NO: 1-32. 
In one alternative, the polynucleotide encodes a polypeptide selected from the group consisting of 
SEQ ID NO: 1-32. In another alternative, the polynucleotide is selected from the group consisting of 

20 SEQE)NO:33-64. 

Additionally, the invention provides a recombinant polynucleotide comprising a promoter 
sequence operably linked to a polynucleotide encoding a polypeptide selected from the group 
consisting of a) a polypeptide comprising an amino acid sequence selected from the group 
consisting of SEQ ID NO: 1-32, b) a polypeptide comprising a naturally occurring amino acid 

25 sequence at least 90% identical to an amino acid sequence selected from the group consisting of 
SEQ ID NO: 1-32, c) a biologically active fragment of a polypeptide having an amino acid sequence 
selected from the group consisting of SEQ ID NO: 1-32, and d) an immunogenic fragment of a 
polypeptide having an amino acid sequence selected from the group consisting of SEQ ID NO: 1-32. 
In one alternative, the invention provides a cell transformed with the recombinant polynucleotide. 

30 In another alternative, the invention provides a transgenic organism comprising the recombinant 
polynucleotide. 

The invention also provides a method for producing a polypeptide selected from the group 
consisting of a) a polypeptide comprising an amino acid sequence selected from the group 
consisting of SEQ ID NO: 1-32, b) a polypeptide comprising a naturally occurring amino acid 
35 sequence at least 90% identical to an amino acid sequence selected from the group consisting of 
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SEQ ID NO: 1-32, c) a biologically active fragment of a polypeptide having an amino acid sequence 
selected from the group consisting of SEQ ID NO: 1-32, and d) an immunogenic fragment of a 
polypeptide having an amino acid sequence selected from the group consisting of SEQ ID NO: 1-32. 
The method comprises a) culturing a cell under conditions suitable for expression of the 
polypeptide, wherein said cell is transformed with a recombinant polynucleotide comprising a 
promoter sequence operably linked to a polynucleotide encoding the polypeptide, and b) recovering 
the polypeptide so expressed. 

Additionally, the invention provides an isolated antibody which specifically binds to a 
polypeptide selected from the group consisting of a) a polypeptide comprising an amino acid 
sequence selected from the group consisting of SEQ ID NO: 1-32, b) a polypeptide comprising a 
naturally occurring amino acid sequence at least 90% identical to an amino acid sequence selected 
from the group consisting of SEQ ID NO: 1-32, c) a biologically active fragment of a polypeptide . ✓ 
having an amino acid sequence selected from the group consisting of SEQ ID NO: 1-32, and d) an 
immunogenic fragment of a polypeptide having an amino acid sequence selected from the group 
consisting of SEQ ID NO: 1-32. 

The invention further provides an isolated polynucleotide selected from the group 
consisting of a) a polynucleotide comprising a polynucleotide sequence selected from the group 
consisting of SEQ ID NO:33-64, b) a polynucleotide comprising a naturally occurring 
polynucleotide sequence at least 90% identical to a polynucleotide sequence selected from the 
group consisting of SEQ ID NO:33-64, c) a polynucleotide complementary to the polynucleotide of 
a), d) a polynucleotide complementary to the polynucleotide of b), and e) an RNA equivalent of a)- 
d). In one alternative, the polynucleotide comprises at least 60 contiguous nucleotides. 

Additionally, the invention provides a method for detecting a target polynucleotide in a 
sample, said target polynucleotide having a sequence of a polynucleotide selected from the group 
consisting of a) a polynucleotide comprising a polynucleotide sequence selected from the group 
consisting of SEQ ID NO:33-64, b) a polynucleotide comprising a naturally occurring 
polynucleotide sequence at least 90% identical to a polynucleotide sequence selected from the 
group consisting of SEQ ID NO:33-64, c) a polynucleotide complementary to the polynucleotide of 
a), d) a polynucleotide complementary to the polynucleotide of b), and e) an RNA equivalent. of a)- 
d). The method comprises a) hybridizing the sample with a probe comprising at least 20 contiguous 
nucleotides comprising a sequence complementary to said target polynucleotide in the sample, and 
which probe specifically hybridizes to said target polynucleotide, under conditions whereby a 
hybridization complex is formed between said probe and said target polynucleotide or fragments 
thereof, and b) detecting the presence or absence of said hybridization complex, and optionally, if 
present, the amount thereof. In one alternative, the probe comprises at least 60 contiguous 
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nucleotides. 

The invention further provides a method for detecting a target polynucleotide in a sample, 
said target polynucleotide having a sequence of a polynucleotide selected from the group consisting 
of a) a polynucleotide comprising a polynucleotide sequence selected from the group consisting of 
5 SEQ ID NO.33-64, b) a polynucleotide comprising a naturally occurring polynucleotide sequence at 
least 90% identical to a polynucleotide sequence selected from the group consisting of SEQ ID 
NO:33-64, c) a polynucleotide complementary to the polynucleotide of a), d) a polynucleotide 
complementary to the polynucleotide of b), and e) an RNA equivalent of a)-d). The method 
comprises a) amplifying said target polynucleotide or fragment thereof using polymerase chain 

10 reaction amplification, and b) detecting the presence or absence of said amplified target 
polynucleotide or fragment thereof, and, optionally, if present, the amount thereof. 

The invention further provides a composition comprising an effective amount of a ■ ' 

polypeptide selected from the group consisting of a) a polypeptide comprising an amino acid 
sequence selected from the group consisting of SEQ ID NO: 1-32, b) a polypeptide comprising a 

15 naturally occurring amino acid sequence at least 90% identical to an amino acid sequence selected 
from the group consisting of SEQ ID NO: 1-32, c) a biologically active fragment of a polypeptide 
having an amino acid sequence selected from the group consisting of SEQ ID NO: 1-32, and d) an 
immunogenic fragment of a polypeptide having an amino acid sequence selected from the group 
consisting of SEQ ID NO: 1-32, and a pharmaceutically acceptable excipient. In one embodiment, 

20 the composition comprises an amino acid sequence selected from the group consisting of SEQ ID 
NO: 1-32. The invention additionally provides a method of treating a disease or condition associated 
with decreased expression of functional TRICH, comprising administering to a patient in need of 
such treatment the composition. 

The invention also provides a method for screening a compound for effectiveness as an 

25 agonist of a polypeptide selected from the group consisting of a) a polypeptide comprising an amino 
acid sequence selected from the group consisting of SEQ ID NO:l-32, b) a polypeptide comprising 
a naturally occurring amino acid sequence at least 90% identical to an amino acid sequence selected 
from the group consisting of SEQ ID NO: 1-32, c) a biologically active fragment of a polypeptide 
having an amino acid sequence selected from the group consisting of SEQ ID NO: 1-32, and d) an 

30 immunogenic fragment of a polypeptide having an amino acid sequence selected from the group 
consisting of SEQ ID NO: 1-32. The method comprises a) exposing a sample comprising the 
polypeptide to a compound, and b) detecting agonist activity in the sample. In one alternative, the 
invention provides a composition comprising an agonist compound identified by the method and a 
pharmaceutically acceptable excipient. In another alternative, the invention provides a method of 

35 treating a disease or condition associated with decreased expression of functional TRICH, 
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comprising administering to a patient in need of such treatment the composition. 

Additionally, the invention provides a method for screening a compound for effectiveness 
as an antagonist of a polypeptide selected from the group consisting of a) a polypeptide comprising 
an amino acid sequence selected from the group consisting of SEQ ED NO: 1-32, b) a polypeptide 
5 comprising a naturally occurring amino acid sequence at least 90% identical to an amino acid 
sequence selected from the group consisting of SEQ ID NO: 1-32, c) a biologically active fragment 
of a polypeptide having an amino acid sequence selected from the group consisting of SEQ ID 
NO: 1-32, and d) an immunogenic fragment of a polypeptide having an amino acid sequence selected 
from the group consisting of SEQ ID NO: 1-32. The method comprises a) exposing a sample 

10 comprising the polypeptide to a compound, and b) detecting antagonist activity in the sample. In 
one alternative, the invention provides a composition comprising an antagonist compound identified 
by the method and a pharmaceutical^ acceptable excipient. In another alternative, the invention ' 
provides a method of treating a disease or condition associated with overexpression of functional 
TRICH, comprising administering to a patient in need of such treatment the composition. 

15 The invention further provides a method of screening for a compound that specifically 

binds to a polypeptide selected from the group consisting of a) a polypeptide comprising an amino 
acid sequence selected from the group consisting of SEQ ID NO: 1-32, b) a polypeptide comprising 
a naturally occurring amino acid sequence at least 90% identical to an amino acid sequence selected 
from the group consisting of SEQ ID NO: 1-32, c) a biologically active fragment of a polypeptide 

20 having an amino acid sequence selected from the group consisting of SEQ ID NO: 1-32, and d) an 
immunogenic fragment of a polypeptide having an amino acid sequence selected from the group 
consisting of SEQ ID NO: 1-32. The method comprises a) combining the polypeptide with at least 
one test compound under suitable conditions, and b) detecting binding of the polypeptide to the test 
compound, thereby identifying a compound that specifically binds to the polypeptide. 

25 Th e invention further provides a method of screening for a compound that modulates the 

activity of a polypeptide selected from the group consisting of a) a polypeptide comprising an amino 
acid sequence selected from the group consisting of SEQ ID NO: 1-32, b) a polypeptide comprising 
a naturally occurring amino acid sequence at least 90% identical to an amino acid sequence selected 
from the group consisting of SEQ ID NO: 1-32, c) a biologically active fragment of a polypeptide 

30 having an amino acid sequence selected from the group consisting of SEQ ID NO: 1-32, and d) an 
immunogenic fragment of a polypeptide having an amino acid sequence selected from the group 
consisting of SEQ ID NO: 1-32. The method comprises a) combining the polypeptide with at least 
one test compound under conditions permissive for the activity of the polypeptide, b) assessing the 
activity of the polypeptide in the presence of the test compound, and c) comparing the activity of 

35 the polypeptide in the presence of the test compound with the activity of the polypeptide in the 
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absence of the test compound, wherein a change in the activity of the polypeptide in the presence of 
the test compound is indicative of a compound that modulates the activity of the polypeptide. 

The invention further provides a method for screening a compound for effectiveness in 
altering expression of a target polynucleotide, wherein said target polynucleotide comprises a 
5 polynucleotide sequence selected from the group consisting of SEQ ID NO:33-64, the method 
comprising a) exposing a sample comprising the target polynucleotide to a compound, b) detecting 
altered expression of the target polynucleotide, and c) comparing the expression of the target 
polynucleotide in the presence of varying amounts of the compound and in the absence of the 
compound. 

10 The invention further provides a method for assessing toxicity of a test compound, said 

method comprising a) treating a biological sample containing nucleic acids with the test compound; 
b) hybridizing the nucleic acids of the treated biological sample with a probe comprising at least 2(V 
contiguous nucleotides of a polynucleotide selected from the group consisting of i) a polynucleotide 
comprising a polynucleotide sequence selected from the group consisting of SEQ ID NO:33-64, ii) 

15 a polynucleotide comprising a naturally occurring polynucleotide sequence at least 90% identical to 
a polynucleotide sequence selected from the group consisting of SEQ ID NO:33-64, iii) a 
polynucleotide having a sequence complementary to i), iv) a polynucleotide complementary to the 
polynucleotide of ii), and v) an RNA equivalent of i)-iv). Hybridization occurs under conditions 
whereby a specific hybridization complex is formed between said probe and a target polynucleotide 

20 in the biological sample, said target polynucleotide selected from the group consisting of i) a 

polynucleotide comprising a polynucleotide sequence selected from the group consisting of SEQ ID 
NO:33-64, ii) a polynucleotide comprising a naturally occurring polynucleotide sequence at least 
90% identical to a polynucleotide sequence selected from the group consisting of SEQ ID NO:33- 
64, iii) a polynucleotide complementary to the polynucleotide of i), iv) a polynucleotide 

25 complementary to the polynucleotide of ii), and v) an RNA equivalent of i)-iv). Alternatively, the 
target polynucleotide comprises a fragment of a polynucleotide sequence selected from the group 
consisting of i)-v) above; c) quantifying the amount of hybridization complex; and d) comparing the 
amount of hybridization complex in the treated biological sample with the amount of hybridization 
complex in an untreated biological sample, wherein a difference in the amount of hybridization 

30 complex in the treated biological sample is indicative of toxicity of the test compound. 

BRIEF DESCRIPTION OF THE TABLES 
Table 1 summarizes the nomenclature for the full length polynucleotide and polypeptide 
sequences of the present invention. 
35 Table 2 shows the GenBank identification number and annotation of the nearest GenBank 
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homolog for polypeptides of the invention. The probability scores for the matches between each 
polypeptide and its homolog(s) are also shown. 

Table 3 shows structural features of polypeptide sequences of the invention, including 
predicted motifs and domains, along with the methods, algorithms, and searchable databases used 
5 for analysis of the polypeptides. 

Table 4 lists the cDNA and/or genomic DNA fragments which were used to assemble 
polynucleotide sequences of the invention, along with selected fragments of the polynucleotide 
sequences. 

Table 5 shows the representative cDNA library for polynucleotides of the invention. 
10 Table 6 provides an appendix which describes the tissues and vectors used for construction 

of the cDNA libraries shown in Table 5. 

Table 7 shows the tools, programs, and algorithms used to analyze the polynucleotides and*' 
polypeptides of the invention, along with applicable descriptions, references, and threshold 
parameters. 

15 

DESCRIPTION OF THE INVENTION 

Before the present proteins, nucleotide sequences, and methods are described, it is 
understood that this invention is not limited to the particular machines, materials and methods 
described, as these may vary. It is also to be understood that the terminology used herein is for the 

20 purpose of describing particular embodiments only, and is not intended to limit the scope of the 
present invention which will be limited only by the appended claims. 

It must be noted that as used herein and in the appended claims, the singular forms "a," 
"an," and "the" include plural reference unless the context clearly dictates otherwise. Thus, for 
example, a reference to "a host cell" includes a plurality of such host cells, and a reference to "an 

25 antibody" is a reference to one or more antibodies and equivalents thereof known to those skilled in 
the art, and so forth. 

Unless defined otherwise, all technical and scientific terms used herein have the same 
meanings as commonly understood by one of ordinary skill in the art to which this invention 
belongs. Although any machines, materials, and methods similar or equivalent to those described 

30 herein can be used to practice or test the present invention, the preferred machines, materials and 
methods are now described. All publications mentioned herein are cited for the purpose of 
describing and disclosing the cell lines, protocols, reagents and vectors which are reported in the 
publications and which might be used in connection with the invention. Nothing herein is to be 
construed as an admission that the invention is not entitled to antedate such disclosure by virtue of 

35 prior invention. 
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DEFINITIONS 

"TRICH" refers to the amino acid sequences of substantially purified TRICH obtained from 
any species, particularly a mammalian species, including bovine, ovine, porcine, murine, equine, 
and human, and from any source, whether natural, synthetic, semi-synthetic, or recombinant. 
5 The term "agonist" refers to a molecule which intensifies or mimics the biological activity 

of TRICH. Agonists may include proteins, nucleic acids, carbohydrates, small molecules, or any 
other compound or composition which modulates the activity of TRICH either by directly 
interacting with TRICH or by acting on components of the biological pathway in which TRICH 
participates. 

10 An "allelic variant" is an alternative form of the gene encoding TRICH. Allelic variants 

may result from at least one mutation in the nucleic acid sequence and may result in altered mRNAs 
or in polypeptides u hose structure or function may or may not be altered. A gene may have none,. ✓ 
one, or many allelic variants of its naturally occurring form. Common mutational changes which 
give rise to allelic variants arc generally ascribed to natural deletions, additions, or substitutions of 

15 nucleotides. Each of these t> pes of changes may occur alone, or in combination with the others, one 
or more times in a given sequence. 

"Altered" nucleic acid sequences encoding TRICH include those sequences with deletions, 
insertions, or substitutions of different nucleotides, resulting in a polypeptide the same as TRICH or 
a polypeptide with at least one functional characteristic of TRICH. Included within this definition 

20 are polymorphisms which may or may not be readily detectable using a particular oligonucleotide 
probe of the polynucleotide encoding TRICH, and improper or unexpected hybridization to allelic 
variants, with a locus other than the normal chromosomal locus for the polynucleotide sequence 
encoding TRICH. The encoded protein may also be "altered," and may contain deletions, 
insertions, or substitutions of amino acid residues which produce a silent change and result in a 

25 functionally equivalent TRICH. Deliberate amino acid substitutions may be made on the basis of 
similarity in polarity, charge, solubility, hydrophobicity, hydrophilicity, and/or the amphipathic 
nature of the residues, as long as the biological or immunological activity of TRICH is retained. For 
example, negatively charged amino acids may include aspartic acid and glutamic acid, and 
positively charged amino acids may include lysine and arginine. Amino acids with uncharged polar 

30 side chains having similar hydrophilicity values may include: asparagine and glutamine; and serine 
and threonine. Amino acids with uncharged side chains having similar hydrophilicity values may 
include: leucine, isoleucine, and valine; glycine and alanine; and phenylalanine and tyrosine. 

The terms "amino acid" and "amino acid sequence" refer to an oligopeptide, peptide, 
polypeptide, or protein sequence, or a fragment of any of these, and to naturally occurring or 

35 synthetic molecules. Where "amino acid sequence" is recited to refer to a sequence of a naturally 
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occurring protein molecule, "amino acid sequence" and like terms are not meant to limit the amino 
acid sequence to the complete native amino acid sequence associated with the recited protein 
molecule. 

"Amplification" relates to the production of additional copies of a nucleic acid sequence. 
5 Amplification is generally carried out using polymerase chain reaction (PCR) technologies well 
known in the art. 

The term "antagonist" refers to a molecule which inhibits or attenuates the biological 
activity of TRICH. Antagonists may include proteins such as antibodies, nucleic acids, 
carbohydrates, small molecules, or any other compound or composition which modulates the 

10 activity of TRICH either by directly interacting with TRICH or by acting on components of the 
biological pathway in which TRICH participates. 

The term "antibody" refers to intact immunoglobulin molecules as well as to fragments ' 
thereof, such as Fab, F(ab% and Fv fragments, which are capable of binding an epitopic 
determinant. Antibodies that bind TRICH polypeptides can be prepared using intact polypeptides or 

15 using fragments containing small peptides of interest as the immunizing antigen. The polypeptide 
or oligopeptide used to immunize an animal (e.g., a mouse, a rat, or a rabbit) can be derived from 
the translation of RNA, or synthesized chemically, and can be conjugated to a carrier protein if 
desired. Commonly used earners that are chemically coupled to peptides include bovine serum 
albumin, thyroglobulin, and keyhole limpet hemocyanin (KLH). The coupled peptide is then used 

20 to immunize the animal. 

The term "antigenic determinant" refers to that region of a molecule (i.e., an epitope) that 
makes contact with a particular antibody. When a protein or a fragment of a protein is used to 
immunize a host animal, numerous regions of the protein may induce the production of antibodies 
which bind specifically to antigenic determinants (particular regions or three-dimensional structures 

25 on the protein). An antigenic determinant may compete with the intact antigen (i.e., the immunogen 
used to elicit the immune response) for binding to an antibody. 

The term "aptamer" refers to a nucleic acid or oligonucleotide molecule that binds to a 
specific molecular target. Aptamers are derived from an in vitro evolutionary process (e.g., SELEX 
(Systematic Evolution of Ligands by Exponential Enrichment), described in U.S. Patent No. 

30 5,270,163), which selects for target-specific aptamer sequences from large combinatorial libraries. 
Aptamer compositions may be double-stranded or single-stranded, and may include 
deoxyribonucleotides, ribonucleotides, nucleotide derivatives, or other nucleotide-like molecules. 
The nucleotide components of an aptamer may have modified sugar groups (e.g., the 2-OH group of 
a ribonucleotide may be replaced by 2'-F or 2'-NH 2 ), which may improve a desired property, e.g., 

35 resistance to nucleases or longer lifetime in blood. Aptamers may be conjugated to other molecules, 
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e.g., a high molecular weight carrier to slow clearance of the aptamer from the circulatory system. 
Aptamers may be specifically cross-linked to their cognate ligands, e.g., by photo-activation of a 
cross-linker. (See, e.g., Brody, E.N. and L. Gold (2000) J. Biotechnol. 74:5-13.) 

The term "intramer" refers to an aptamer which is expressed in vivo . For example, a 
5 vaccinia virus-based RNA expression system has been used to express specific RNA aptamers at 
high levels in the cytoplasm of leukocytes (Blind, M. et al. (1999) Proc. Natl Acad. Sci. USA 
96:3606-3610). 

The term "spiegelmer" refers to an aptamer which includes L-DNA, L-RNA, or other left- 
handed nucleotide derivatives or nucleotide-like molecules. Aptamers containing left-handed 

10 nucleotides are resistant to degradation by naturally occurring enzymes, which normally act on 
substrates containing right-handed nucleotides. 

The term "antisense" refers to any composition capable of base-pairing with the "sense" *' 
(coding) strand of a specific nucleic acid sequence. Antisense compositions may include DNA; 
RNA; peptide nucleic acid (PNA); oligonucleotides having modified backbone linkages such as 

15 phosphorothioates, methylphosphonates, or benzylphosphonates; oligonucleotides having modified 
sugar groups such as 2-methoxyethyl sugars or 2'-methoxyethoxy sugars; or oligonucleotides having 
modified bases such as 5-methyl cytosine, 2 ? -deoxyuracil, or 7-deaza-2'-deoxyguanosine. Antisense 
molecules may be produced by any method including chemical synthesis or transcription. Once 
introduced into a cell, the complementary antisense molecule base-pairs with a naturally occurring 

20 nucleic acid sequence produced by the cell to form duplexes which block either transcription or 
translation. The designation "negative" or "minus" can refer to the antisense strand, and the 
designation "positive" or "plus" can refer to the sense strand of a reference DNA molecule. 

The term "biologically active" refers to a protein having structural, regulatory, or 
biochemical functions of a naturally occurring molecule. Likewise, "immunologically active" or 

25 "immunogenic" refers to the capability of the natural, recombinant, or synthetic TRICH, or of any 
oligopeptide thereof, to induce a specific immune response in appropriate animals or cells and to 
bind with specific antibodies. 

"Complementary" describes the relationship between two single-stranded nucleic acid 
sequences that anneal by base-pairing. For example, 5-AGT-3' pairs with its complement, 

30 3'-TCA-5'. 

A "composition comprising a given polynucleotide sequence" and a "composition 
comprising a given amino acid sequence" refer broadly to any composition containing the given 
polynucleotide or amino acid sequence. The composition may comprise a dry formulation or an 
aqueous solution. Compositions comprising polynucleotide sequences encoding TRICH or 
35 fragments of TRICH may be employed as hybridization probes. The probes may be stored in freeze- 
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dried form and may be associated with a stabilizing agent such as a carbohydrate. In hybridizations, 
the probe may be deployed in an aqueous solution containing salts (e.g., NaCl), detergents (e.g., 
sodium dodecyl sulfate; SDS), and other components (e.g., Denhardt's solution, dry milk, salmon 
sperm DNA, etc.). 

"Consensus sequence" refers to a nucleic acid sequence which has been subjected to 
repeated DNA sequence analysis to resolve uncalled bases, extended using the XL-PCR kit 
(Applied Biosystems, Foster City CA) in the 5' and/or the 3' direction, and resequenced, or which 
has been assembled from one or more overlapping cDNA, EST, or genomic DNA fragments using a 
computer program for fragment assembly, such as the GELVD3W fragment assembly system (GCG, 
Madison WI) or Phrap (University of Washington, Seattle WA). Some sequences have been both 
extended and assembled lo produce the consensus sequence. 

"Conservative amino acid substitutions" are those substitutions that are predicted to least ' 
interfere with the properties of the original protein, i.e., the structure and especially the function of 
the protein is conserved and noi significantly changed by such substitutions. The table below shows 
amino acids which may be substituted for an original amino acid in a protein and which are 
regarded as conservative amino acid substitutions. 



Original Residue 


Conservative Substitution 


Ala 


Gly, Ser 


Arg 


His, Lys 


Asn 


Asp, Gin, His 


Asp 


Asn, Glu 


Cys 


Ala, Ser 


Gin 


Asn, Glu, His 


Glu 


Asp, Gin, His 


Gly 


Ala 


His 


Asn, Arg, Gin, Glu 


lie 


Leu, Val 


Leu 


Be, Val 


Lys 


Arg, Gin, Glu 


Met 


Leu, lie 


Phe 


His, Met, Leu, Trp, Tyr 


Ser 


Cys, Thr 


Thr 


Ser, Val 


Trp 


Phe, Tyr 


Tyr 


His, Phe, Trp 


Val 


He, Leu, Thr 



Conservative amino acid substitutions generally maintain (a) the structure of the 
polypeptide backbone in the area of the substitution, for example, as a beta sheet or alpha helical 
conformation, (b) the charge or hydrophobicity of the molecule at the site of the substitution, and/or 
(c) the bulk of the side chain. 

A "deletion" refers to a change in the amino acid or nucleotide sequence that results in the 
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absence of one or more amino acid residues or nucleotides. 

The term "derivative" refers to a chemically modified polynucleotide or polypeptide. 
Chemical modifications of a polynucleotide can include, for example, replacement of hydrogen by 
an alkyl, acyl, hydroxyl, or amino group. A derivative polynucleotide encodes a polypeptide which 
5 retains at least one biological or immunological function of the natural molecule. A derivative 
polypeptide is one modified by glycosylation, pegylation, or any similar process that retains at least 
one biological or immunological function of the polypeptide from which it was derived. 

A "detectable label" refers to a reporter molecule or enzyme that is capable of generating a 
measurable signal and is covalently or noncovalently joined to a polynucleotide or polypeptide. 

10 "Differential expression" refers to increased or upregulated; or decreased, downregulated, 

. or absent gene or protein expression, determined by comparing at least two different samples. Such 
comparisons may be carried out between, for example, a treated and an untreated sample, or a . ✓ 
diseased and a normal sample. . 

"Exon shuffling" refers to the recombination of different coding regions (exons). Since an 

15 exon may represent a structural or functional domain of the encoded protein, new proteins may be 
assembled through the novel reassortment of stable substructures, thus allowing acceleration of the 
evolution of new protein functions. 

A "fragment" is a unique portion of TRICH or the polynucleotide encoding TRICH which is 
identical in sequence to but shorter in length than the parent sequence. A fragment may comprise 

20 up to the entire length of the defined sequence, minus one nucleotide/amino acid residue. For 
example, a fragment may comprise from 5 to 1000 contiguous nucleotides or amino acid residues. 
A fragment used as a probe, primer, antigen, therapeutic molecule, or for other purposes, may be at 
least 5, 10, 15, 16, 20, 25, 30, 40, 50, 60, 75, 100, 150, 250 or at least 500 contiguous nucleotides or 
amino acid residues in length. Fragments may be preferentially selected from certain regions of a 

25 molecule. For example, a polypeptide fragment may comprise a certain length of contiguous amino 
acids selected from the first 250 or 500 amino acids (or first 25% or 50%) of a polypeptide as 
shown in a certain defined sequence. Clearly these lengths are exemplary, and any length that is 
supported by the specification, including the Sequence Listing, tables, and figures, may be 
encompassed by the present embodiments. 

30 A fragment of SEQ ID NO:33-64 comprises a region of unique polynucleotide sequence 

that specifically identifies SEQ ID NO:33-64, for example, as distinct from any other sequence in 
the genome from which the fragment was obtained. A fragment of SEQ ID NO:33-64 is useful, for 
example, in hybridization and amplification technologies and in analogous methods that distinguish 
SEQ ID NO:33-64 from related polynucleotide sequences. The precise length of a fragment of SEQ 

35 ID NO:33-64 and the region of SEQ ID NO:33-64 to which the fragment corresponds are routinely 
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determinable by one of ordinary skill in the art based on the intended purpose for the fragment. 

A fragment of SEQ ID NO: 1-32 is encoded by a fragment of SEQ ID NO:33-64. A 
fragment of SEQ ID NO: 1-32 comprises a region of unique amino acid sequence that specifically 
identifies SEQ ID NO: 1-32. For example, a fragment of SEQ ID NO: 1-32 is useful as an 
5 immunogenic peptide for the development of antibodies that specifically recognize SEQ ID NO: 1- 
32. The precise length of a fragment of SEQ ID NO:l-32 and the region of SEQ ID NO:l-32 to 
which the fragment corresponds are routinely determinable by one of ordinary skill in the art based 
on the intended purpose for the fragment. 

A "full length" polynucleotide sequence is one containing at least a translation initiation 
10 codon (e.g.. methionine) followed by an open reading frame and a translation termination codon. A 
"full length" polynucleotide sequence encodes a "full length" polypeptide sequence. 

"Homology" refers to sequence similarity or, interchangeably, sequence identity, between ' 
two or more polynucleotide sequences or two or more polypeptide sequences. 

The terms "percent identity" and «% identity," as applied to polynucleotide sequences, refer 
15 to the percentage of residue matches between at least two polynucleotide sequences aligned using a 
standardized algorithm. Such an algorithm may insert, in a standardized and reproducible way, gaps 
in the sequences being compared in order to optimize alignment between two sequences, and 
therefore achieve a more meaningful comparison of the two sequences. 

Percent identity between polynucleotide sequences may be determined using the default 
20 parameters of the CLUSTAL V algorithm as incorporated into the MEGALIGN version 3. 12e 

sequence alignment program. This program is part of the LASERGENE software package, a suite of 
molecular biological analysis programs (DNASTAR, Madison WI). CLUSTAL V is described in 
Higgins, D.G. and P.M. Sharp (1989) CABIOS 5:151-153 and in Higgins, D.G. et al. (1992) 
CABIOS 8:189-191. For pairwise alignments of polynucleotide sequences, the default parameters 
25 are set as follows: Ktuple=2, gap penalty=5, window=4, and "diagonals saved"=4. The "weighted- 
residue weight table is selected as the default. Percent identity is reported by CLUSTAL V as the 
"percent similarity" between aligned polynucleotide sequences. 

Alternatively, a suite of commonly used and freely available sequence comparison 
algorithms is provided by the National Center for Biotechnology Information (NCBI) Basic Local 
30 Alignment Search Tool (BLAST) (Altschul, S.F. et al. (1990) J. Mol. Biol. 215:403^10), which is 
available from several sources, including the NCBL Bethesda, MD, and on the Internet at 
http://www.ncbi.nlm.nih.gov/BLAST/. The BLAST software suite includes various sequence 
analysis programs including "blastn," that is used to align a known polynucleotide sequence with 
other polynucleotide sequences from a variety of databases. Also available is a tool called "BLAST 
35 2 Sequences" that is used for direct pairwise comparison of two nucleotide sequences. "BLAST 2 
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Sequences" can be accessed and used interactively at http://www.ncbi.nlm.nih.gov/gorf/bl2.html. 
The "BLAST 2 Sequences" tool can be used for both blastn and blastp (discussed below). BLAST 
programs are commonly used with gap and other parameters set to default settings. For example, to 
compare two nucleotide sequences, one may use blastn with the "BLAST 2 Sequences" tool 
5 Version 2.0. 12 (April-2 1-2000) set at default parameters. Such default parameters may be, for 
example: 

Matrix: BLOSUM62 

Reward for match: 1 

Penalty for mismatch: -2 
10 Open Gap: 5 and Extension Gap: 2 penalties 

Gap x drop-off: 50 • 

Expect: 10 . ✓ 

Word Size: 11 
Filter: on 

15 Percent identity may be measured over the length of an entire defined sequence, for 

example, as defined by a particular SEQ ID number, or may be measured over a shorter length, for 
example, over the length of a fragment taken from a larger, defined sequence, for instance, a 
fragment of at least 20, at least 30, at least 40, at least 50, at least 70, at least 100, or at least 200 
contiguous nucleotides. Such lengths are exemplary only, and it is understood that any fragment 

20 length supported by the sequences shown herein, in the tables, figures, or Sequence Listing, may be 
used to describe a length over which percentage identity may be measured. 

Nucleic acid sequences that do not show a high degree of identity may nevertheless encode 
similar amino acid sequences due to the degeneracy of the genetic code. It is understood that 
changes in a nucleic acid sequence can be made using this degeneracy to produce multiple nucleic 

25 acid sequences that all encode substantially the same protein. 

The phrases "percent identity" and "% identity," as applied to polypeptide sequences, refer 
to the percentage of residue matches between at least two polypeptide sequences aligned using a 
standardized algorithm. Methods of polypeptide sequence alignment are well-known. Some 
alignment methods take into account conservative amino acid substitutions. Such conservative 

30 substitutions, explained in more detail above, generally preserve the charge andjiydrophobicity at 
the site of substitution, thus preserving the structure (and therefore function) of the polypeptide. 

Percent identity between polypeptide sequences may be determined using the default 
parameters of the CLUSTAL V algorithm as incorporated into the MEGALIGN version 3. 12e 
sequence alignment program (described and referenced above). For pairwise alignments of 

35 polypeptide sequences using CLUSTAL V, the default parameters are set as follows: Ktuple=l, gap 
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penalty=3, window=5, and "diagonals saved"=5. The PAM250 matrix is selected as the default 
residue weight table. As with polynucleotide alignments, the percent identity is reported by 
CLUSTAL V as the "percent similarity" between aligned polypeptide sequence pairs. 

Alternatively the NCBI BLAST software suite may be used. For example, for a pairwise 
5 comparison of two polypeptide sequences, one may use the "BLAST 2 Sequences" tool Version 
2.0.12 (April-2 1-2000) with blastp set at default parameters. Such default parameters may be, for 
example: 

Matrix: BLOSUM62 

Open Gap; 11 and Extension Gap: 1 penalties 
10 Gap x drop-off: 50 

Expect: 10 
Word Size: 3 
Filter: on 

Percent identity may be measured over the length of an entire defined polypeptide sequence, 

15 for example, as defined by a particular SEQ ID number, or may be measured over a shorter length, 
for example, over the length of a fragment taken from a larger, defined polypeptide sequence, for 
instance, a fragment of at least 15, at least 20, at least 30, at least 40, at least 50, at least 70 or at : 
least 150 contiguous residues. Such lengths are exemplary only, and it is understood that any 
fragment length supported by the sequences shown herein, in the tables, figures or Sequence 

20 Listing, may be used to describe a length over which percentage identity may be measured. 

"Human artificial chromosomes" (HACs) are linear microchromosomes which may contain 
DNA sequences of about 6 kb to 10 Mb in size and which contain all of the elements required for 
chromosome replication, segregation and maintenance. - ( 

The term "humanized antibody" refers to an antibody molecule in which the amino acid 

25 sequence in the non-antigen binding regions has been altered so that the antibody more closely 
resembles a human antibody, and still retains its original binding ability. 

"Hybridization" refers to the process by which a polynucleotide strand anneals with a 
complementary strand through base pairing under defined hybridization conditions. Specific 
hybridization is an indication that two nucleic acid sequences share a high degree of 

30 complementarity. Specific hybridization complexes form under permissive annealing conditions 
and remain hybridized after the "washing" step(s). The washing step(s) is particularly important in 
determining the stringency of the hybridization process, with more stringent conditions allowing 
less non-specific binding, i.e., binding between pairs of nucleic acid strands that are not perfectly 
matched. Permissive conditions for annealing of nucleic acid sequences are routinely determinable 

35 by one of ordinary skill in the art and may be consistent among hybridization experiments, whereas 
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wash conditions may be varied among experiments to achieve the desired stringency, and therefore 
hybridization specificity. Permissive annealing conditions occur, for example, at 68°C in the 
presence of about 6 x SSC, about 1% (w/v) SDS, and about 100 /ig/ml sheared, denatured salmon 
sperm DNA. 

5 Generally, stringency of hybridization is expressed, in part, with reference to the 

temperature under which the wash step is carried out. Such wash temperatures are typically 
selected to be about 5°C to 20°C lower than the thermal melting point (TJ for the specific sequence 
at a defined ionic strength and pH. The T m is the temperature (under defined ionic strength and pH) 
at which 50% of the target sequence hybridizes to a perfectly matched probe. An equation for 

10 calculating T m and conditions for nucleic acid hybridization are well known and can be found in 
Sambrook, J. et al. (1989) Molecular Cloning: A Laboratory Manual , 2 nd ed., vol. 1-3, Cold Spring 
Harbor Press, Plainview NY; specifically see volume 2, chapter 9. . <* 

High stringency conditions for hybridization between polynucleotides of the present 
invention include wash conditions of 68°C in the presence of about 0.2 x SSC and about 0.1% SDS, 

15 for 1 hour. Alternatively, temperatures of about 65°C, 60°C, 55°C, or 42°C may be used. SSC 
concentration may be varied from about 0.1 to 2 x SSC, with SDS being present at about 0.1%. 
Typically, blocking reagents are used to block non-specific hybridization. Such blocking reagents 
include, for instance, sheared and denatured salmon sperm DNA at about 100-200 /ig/ml. Organic 
solvent, such as formamide at a concentration of about 35-50% v/v, may also be used under 

20 particular circumstances, such as for RNArDNA hybridizations. Useful variations on these wash 
conditions will be readily apparent to those of ordinary skill in the art. Hybridization, pjarticiilarly 
under high stringency conditions, may be suggestive of evolutionary similarity between the 
nucleotides. Such similarity is strongly indicative of a similar role for the nucleotides arid their 
encoded polypeptides. 

25 The term "hybridization complex" refers to a complex formed between two nucleic acid 

sequences by virtue of the formation of hydrogen bonds between complementary bases. A 
hybridization complex may be formed in solution (e.g., C 0 t or I^t analysis) or formed between one 
nucleic acid sequence present in solution and another nucleic acid sequence immobilized on a solid 
support (e.g., paper, membranes, filters, chips, pins or glass slides, or any other appropriate 

30 substrate to which cells or their nucleic acids have been fixed). 

The words "insertion" and "addition" refer to changes in an amino acid or nucleotide 
sequence resulting in the addition of one or more amino acid residues or nucleotides, respectively. 

. "Immune response" can refer to conditions associated with inflammation, trauma, immune 
disorders, or infectious or genetic disease, etc. These conditions can be characterized by expression 

35 of various factors, e.g., cytokines, chemokines, and other signaling molecules, which may affect 
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cellular and systemic defense systems. 

An "immunogenic fragment" is a polypeptide or oligopeptide fragment of TRICH which is 
capable of eliciting an immune response when introduced into a living organism, for example, a 
mammal. The term "immunogenic fragment" also includes any polypeptide or oligopeptide 
5 fragment of TRICH which is useful in any of the antibody production methods disclosed herein or 
known in the art. 

The term "microarray" refers to an arrangement of a plurality of polynucleotides, 
polypeptides, or other chemical compounds on a substrate. 

The terms "element" and "array element" refer to a polynucleotide, polypeptide, or other 
10 chemical compound having a unique and defined position on a microarray. 

The term "modulate" refers to a change in the activity of TRICH. For example, modulation 
may cause an increase or a decrease in protein activity, binding characteristics, or any other <• 
biological, functional, or immunological properties of TRICH. 

The phrases "nucleic acid" and "nucleic acid sequence" refer to a nucleotide, 
15 oligonucleotide, polynucleotide, or any fragment thereof. These phrases also refer to DNA or RNA 
of genomic or synthetic origin which may be single-stranded or double-stranded and may represent 
the sense or the antisense strand, to peptide nucleic acid (PNA), or to any DNA-like or RNA-like 
material. 

"Operably linked" refers to the situation in which a first nucleic acid sequence is placed in a 
20 functional relationship with a second nucleic acid sequence. For instance, a promoter is operably 
linked to a coding sequence if the promoter affects the transcription or expression of the coding 
sequence. Operably linked DNA sequences may be in close proximity or contiguous and, where 
necessary to join two protein coding regions, in the same reading frame. 

'Teptide nucleic acid" (PNA) refers to an antisense molecule or anti-gene agent which 
25 comprises an oligonucleotide of at least about 5 nucleotides in length linked to a peptide backbone 
of amino acid residues ending in lysine. The terminal lysine confers solubility to the composition. 
PNAs preferentially bind complementary single stranded DNA or RNA and stop transcript 
elongation, and may be pegylated to extend their lifespan in the cell. 

"Post-translational modification" of an TRICH may involve lipidation, glycosylation, 
30 phosphorylation, acetylation, racemization, proteolytic cleavage, and other modifications known in 
the art. These processes may occur synthetically or biochemically. Biochemical modifications will 
vary by cell type depending on the enzymatic milieu of TRICH. 

"Probe" refers to nucleic acid sequences encoding TRICH, their complements, or fragments 
thereof, which are used to detect identical, allelic or related nucleic acid sequences. Probes are 
35 isolated oligonucleotides or polynucleotides attached to a detectable label or reporter molecule. 
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Typical labels include radioactive isotopes, ligands, chemiluminescent agents, and enzymes. 
"Primers" are short nucleic acids, usually DNA oligonucleotides, which may be annealed to a target 
polynucleotide by complementary base-pairing. The primer may then be extended along the target 
DNA strand by a DNA polymerase enzyme. Primer pairs can be used for amplification (and 

5 identification) of a nucleic acid sequence, e.g., by the polymerase chain reaction (PCR). 

Probes and primers as used in the present invention typically comprise at least 15 
contiguous nucleotides of a known sequence. In order to enhance specificity, longer probes and 
primers may also be employed, such as probes and primers that comprise at least 20, 25, 30, 40, 50, 
60, 70, 80, 90, 100, or at least 150 consecutive nucleotides of the disclosed nucleic acid sequences. 

10 Probes and primers may be considerably longer than these examples, and it is understood that any 
length supported by the specification, including the tables, figures, and Sequence Listing, may be 
used. 

Methods for preparing and using probes and primers are described in the references, for 
example Sambrook, J. et al. (1989) Molecular Cloning: A Laboratory Manual . 2 nd ed., vol. 1-3, Cold 

15 Spring Harbor Press, Plainview NY; Ausubel, F.M. et al. (1987) Current Protocols in Molecular 
Biology . Greene Publ. Assoc. & Wiley-Intersciences, New York NY; Innis, M. et al. (1990) PCR 
Protocols, A Guide to Methods and Applications . Academic Press, San Diego CA. PCR primer 
pairs can be derived from a known sequence, for example, by using computer programs intended for 
that purpose such as Primer (Version 0.5, 1991, Whitehead Institute for Biomedical Research, 

20 Cambridge MA). 

Oligonucleotides for use as primers are selected using software known in the art for such 
purpose. For example, OLIGO 4.06 software is useful for the selection of PCR primer pairs of up to 
100 nucleotides each, and for the analysis of oligonucleotides and larger polynucleotides of up to 
5,000 nucleotides from an input polynucleotide sequence of up to 32 lalobases. Similar primer 

25 selection programs have incorporated additional features for expanded capabilities. For example, 
the PrimOU primer selection program (available to the public from the Genome Center at University 
of Texas South West Medical Center, Dallas TX) is capable of choosing specific primers from 
megabase sequences and is thus useful for designing primers on a genome-wide scope. The Primer3 
primer selection program (available to the public from the Whitehead Institute/MIT Center for 

30 Genome Research, Cambridge MA) allows the user to input a "misprinting library," in which 
sequences to avoid as primer binding sites are user-specified. Primer3 is useful, in particular, for 
the selection of oligonucleotides for microarrays. (The source code for the latter two primer 
selection programs may also be obtained from their respective sources and modified to meet the 
user's specific needs.) The PrimeGen program (available to the public from the UK Human 

35 Genome Mapping Project Resource Centre, Cambridge UK) designs primers based on multiple 
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sequence alignments, thereby allowing selection of primers that hybridize to either the most 
conserved or least conserved regions of aligned nucleic acid sequences. Hence, this program is 
useful for identification of both unique and conserved oligonucleotides and polynucleotide 
fragments. The oligonucleotides and polynucleotide fragments identified by any of the above 
5 selection methods are useful in hybridization technologies, for example, as PCR or sequencing 
primers, microarray elements, or specific probes to identify fully or partially complementary 
polynucleotides in a sample of nucleic acids. Methods of oligonucleotide selection are not limited 
to those described above. 

A "recombinant nucleic acid" is a sequence that is not naturally occurring or has a sequence 

10 that is made by an artificial combination of two or more otherwise separated segments of sequence. 
This artificial combination is often accomplished by chemical synthesis or, more commonly, by the 
artificial manipulation of isolated segments of nucleic acids, e.g., by genetic engineering techniques 
such as those described in Sambrook, supra . The term recombinant includes nucleic acids that have 
been altered solely by addition, substitution, or deletion of a portion of the nucleic acid. Frequently, 

15 a recombinant nucleic acid may include a nucleic acid sequence operably linked to a promoter 
sequence. Such a recombinant nucleic acid may be part of a vector that is used, for example, to 
transform a cell. 

Alternatively, such recombinant nucleic acids may be part of a viral vector, e.g., based on a 
vaccinia virus, that could be use to vaccinate a mammal wherein the recombinant nucleic acid is 
20 expressed, inducing a protective immunological response in the mammal. 

A "regulatory element" refers to a nucleic acid sequence usually derived from untranslated 
regions of a gene and includes enhancers, promoters, introns, and 5' and 3' untranslated regions 
(UTRs). Regulatory elements interact with host or viral proteins which control transcription, 
translation, or RNA stability. 
25 "Reporter molecules" are chemical or biochemical moieties used for labeling a nucleic acid, 

amino acid, or antibody. Reporter molecules include radionuclides; enzymes; fluorescent, 
chemiluminescent, or chromogenic agents; substrates; cofactors; inhibitors; magnetic particles; and 
other moieties known in the art. 

An "RNA equivalent," in reference to a DNA sequence, is composed of the same linear 
30 sequence of nucleotides as the reference DNA sequence with the exception that all occurrences of 
the nitrogenous base thymine are replaced with uracil, and the sugar backbone is composed of 
ribose instead of deoxyribose. 

The term "sample" is used in its broadest sense. A sample suspected of containing TRICH, 
nucleic acids encoding TRICH, or fragments thereof may comprise a bodily fluid; an extract from a 
35 cell, chromosome, organelle, or membrane isolated from a cell; a cell; genomic DNA, RNA, or 
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cDNA, in solution or bound to a substrate; a tissue; a tissue print; etc. 

The terms "specific binding" and "specifically binding" refer to that interaction between a 
protein or peptide and an agonist, an antibody, an antagonist, a small molecule, or any natural or 
synthetic binding composition. The interaction is dependent upon the presence of a particular 
5 structure of the protein, e.g., the antigenic determinant or epitope, recognized by the binding 
molecule. For example, if an antibody is specific for epitope "A " the presence of a polypeptide 
comprising the epitope A, or the presence of free unlabeled A, in a reaction containing free labeled 
A and the antibody will reduce the amount of labeled A that binds to the antibody. 

The term "substantially purified" refers to nucleic acid or amino acid sequences that are 
10 removed from their natural environment and are isolated or separated, and are at least 60% free, 
preferably at least 75% free, and most preferably at least 90% free from other components with 
which they are naturally associated. . ✓ 

A "substitution" refers to the replacement of one or more amino acid residues or nucleotides 
by different amino ockJ residues or nucleotides, respectively. 
15 "Substrate" refers to any suitable rigid or semi-rigid support including membranes, filters, 

chips, slides, wafers, fibers, magnetic or nonmagnetic beads, gels, tubing, plates, polymers, 
microparticles and capillaries. The substrate can have a variety of surface forms, such as wells, 
trenches, pins, channels and pores, to which polynucleotides or polypeptides are bound. 

A "transcript image" or "expression profile" refers to the collective pattern of gene 
20 expression by a particular cell type or tissue under given conditions at a given time. 

"Transformation" describes a process by which exogenous DNA is introduced into a 
recipient cell. Transformation may occur under natural or artificial conditions according to various 
methods well known in the art, and may rely on any known method for the insertion of foreign 
nucleic acid sequences into a prokaryotic or eukaryotic host cell. The method for transformation is * 
25 selected based on the type of host cell being transformed and may include, but is not limited to, 
bacteriophage or viral infection, electroporation, heat shock, lipofection, and particle bombardment. 
The term "transformed cells" includes stably transformed cells in which the inserted DNA is 
capable of replication either as an autonomously replicating plasmid or as part of the host 
chromosome, as well as transiently transformed cells which express the inserted DNA or RNA for 
30 limited periods of time. 

A "transgenic organism," as used herein, is any organism, including but not limited to 
animals and plants, in which one or more of the cells of the organism contains heterologous nucleic 
acid introduced by way of human intervention, such as by transgenic techniques well known in the 
art. The nucleic acid is introduced into the cell, directly or indirectly by introduction into a 
35 precursor of the cell, by way of deliberate genetic manipulation, such as by microinjection or by 
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infection with a recombinant vims. The term genetic manipulation does not include classical 
cross-breeding, or in vitro fertilization, but rather is directed to the introduction of a recombinant 
DNA molecule. The transgenic organisms contemplated in accordance with the present invention 
include bacteria, cyanobacteria, fungi, plants and animals. The isolated DNA of the present 
5 invention can be introduced into the host by methods known in the art, for example infection, 
transfection, transformation or transconjugation. Techniques for transferring the DNA of the 
present invention into such organisms are widely known and provided in references such as 
Sambrook et al. (1989), supra. 

A "variant" of a particular nucleic acid sequence is defined as a nucleic acid sequence 

10 having at least 40% sequence identity to the particular nucleic acid sequence over a certain length of 
one of the nucleic acid sequences using blastn with the "BLAST 2 Sequences'* tool Version 2.0.9 
(May-07-1999) set at default parameters. Such a pair of nucleic acids may show, for example, at . ✓ 
least 50%, at least 60%, at least 70%, at least 80%, at least 85%, at least 90%, at least 91%, at least 
92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, or at least 98%, or greater 

15 sequence identity over a certain defined length. A variant may be described as, for example, an 
"allelic" (as defined above), "splice," "species," or "polymorphic" variant. A splice variant may 
have significant identity to a reference molecule, but will generally have a greater or lesser number 
of polynucleotides due to alternate splicing of exons during mRNA processing. The corresponding 
polypeptide may possess additional functional domains or lack domains that are present in the 

20 reference molecule. Species variants are polynucleotide sequences that vary from one species to 
another. The resulting polypeptides will generally have significant amino acid identity relative to 
each other. A polymorphic variant is a variation al the polynucleotide sequence of a particular gene 
between individuals of a given species. Polymorphic variants also may encompass "single 
nucleotide polymorphisms" (SNPs) in which the polynucleotide sequence varies by one nucleotide 

25 base. The presence of SNPs may be indicative of, for example, a certain population, a disease state, 
or a propensity for a disease state. 

A "variant" of a particular polypeptide sequence is defined as a polypeptide sequence 
having at least 40% sequence identity to the particular polypeptide sequence over a certain length of 
one of the polypeptide sequences using blastp with the "BLAST 2 Sequences" tool Version 2.0.9 

30 (May-07-1999) set at default parameters. Such a pair of polypeptides may show, for example, at 
least 50%, at least 60%, at least 70%, at least 80%, at least 90%, at least 91%, at least 92%, at least 
93%, at least 94%, at least 95%, at least 96%, at least 97%, or at least 98%, or greater sequence 
identity over a certain defined length of one of the polypeptides. 

35 THE INVENTION 
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The invention is based on the discovery of new human transporters and ion channels 
(TRICH), the polynucleotides encoding TRICH, and the use of these compositions for the diagnosis, 
treatment, or prevention of transport, neurological, muscle, immunological and cell proliferative 
disorders. 

5 Table 1 summarizes the nomenclature for the full length polynucleotide and polypeptide 

sequences of the invention. Each polynucleotide and its corresponding polypeptide are correlated to 
a single Incyte project identification number (incyte Project ID). Each polypeptide sequence is 
denoted by both a polypeptide sequence identification number (Polypeptide SEQ ID NO:) and an 
Incyte polypeptide sequence number (Incyte Polypeptide ID) as shown. Each polynucleotide 

10 sequencers denoted by both a polynucleotide sequence identification number (Polynucleotide SEQ 
ID NO:) and an Incyte polynucleotide consensus sequence number (Incyte Polynucleotide ID) as 
shown. ^ 

Table 2 shows sequences with homology to the polypeptides of the invention as identified 
by BLAST analysis against the GenBank protein (genpept) database. Columns 1 and 2 show the 

15 polypeptide sequence identification number (Polypeptide SEQ ID NO:) and the corresponding 
Incyte polypeptide sequence number (Incyte Polypeptide ID) for polypeptides of the invention. 
Column 3 shows the GenBank identification number (GenBank ID NO:) of the nearest GenBank 
homolog. Column 4 shows the probability scores for the matches between each polypeptide and its 
homolog(s). Column 5 shows the annotation of the GenBank homolog(s) along with relevant 

20 citations where applicable, all of which are expressly incorporated by reference herein. 

Table 3 shows various structural features of the polypeptides of the invention. Columns 1 
and 2 show the polypeptide sequence identification number (SEQ ID NO:) and the corresponding " 
Incyte polypeptide sequence number (Incyte Polypeptide ID) for each polypeptide of the invention. 
Column 3 shows the number of amino acid residues in each polypeptide. Column 4 shows potential 

25 phosphorylation sites, and column 5 shows potential glycosylation sites, as determined by the 
MOTIFS program of the GCG sequence analysis software package (Genetics Computer Group, 
Madison WI). Column 6 shows amino acid residues comprising signature sequences, domains, and 
motifs. Column 7 shows analytical methods for protein structure/function analysis and in some 
cases, searchable databases to which the analytical methods were applied. 

30 Together, Tables 2 and 3 summarize the properties of polypeptides of the invention, and 

these properties establish that the claimed polypeptides are transporters and ion channels. For 
example, SEQ ID NO:l is 83% identical to human sodium-hydrogen exchanger 6 (GenBank ID 
g2944233) as determined by the Basic Local Alignment Search Tool (BLAST). (See Table 2.) The 
BLAST probability score is 5.1e-242, which indicates the probability of obtaining the observed 

35 polypeptide sequence alignment by chance. SEQ ID NO:l also contains a sodium/hydrogen 

35 



iSDOCID: <WO 024641 5A2_I_> 



WO 02/46415 



PCT/US01/46963 



exchanger family domain as determined by searching for statistically significant matches in the 
hidden Markov model (HMM)-based PFAM database of conserved protein family domains. (See 
Table 3.) Data from BLIMPS, and other BLAST analyses provide further corroborative evidence 
that SEQ ID NO.l is a sodium-hydrogen exchange transporter. In another example, SEQ ID NO:7 
5 is 85% identical to Rattus norveeicus Na7,K + -ATPase alpha subunit (GenBarik ID g6 199 15) as 
determined by the Basic Local Alignment Search Tool (BLAST). (See Table 2.) The BLAST 
probability score is 0.0, which indicates the probability of obtaining the observed polypeptide 
sequence alignment by chance. SEQ ID NO:7 also contains an E1-E2 ATPase domain as 
determined by searching for statistically significant matches in the hidden Markov model (HMM)- 
10 based PFAM database of conserved protein family domains. (See Table 3.) Data from BLIMPS, 
MOTIFS, and PROFILESCAN analyses provide further corroborative evidence that SEQ ID NO:7 
is a cation-transporting ATPase. In yet another example, SEQ ID NO: 13 is 77% identical to a . s 
human carrier-like prolan (GenBank ID g3694661) as determined by the Basic Local Alignment 
Search Tool (BLAST). (See Table 2.) The BLAST probability score is 5.5e-209, which indicates 

15 the probability of obtaining the observed polypeptide sequence alignment by chance. SEQ ID 

NO: 13 also contains a mitochondrial energy transfer protein domain as determined by searching for 
statistically significant matches in the hidden Markov model (HMM)-based PFAM database of 
conserved protein family domains. (See Table 3.) The presence of this domain is confirmed by 
BLIMPS, MOTIFS, and PROFILESCAN analyses, providing further corroborative evidence that 

20 SEQ ID NO: 13 is a transporter. Further, SEQ ID NO: 16 is 41% identical to human novel ATPase 
(GenBank ID g8979S01 ) as determined by the Basic Local Alignment Search Tool (BLAST). (See 
Table 2.) The BLAST probability score is l.le-165, which indicates the probability of obtaining the 
observed polypeptide sequence alignment by chance. SEQ ID NO: 16 also contains an E1-E2 
ATPase domain as determined by searching for statistically significant matches in the hidden 

25 Markov model (HMM)-based PFAM database of conserved protein family domains. (See Table 3.) 
Data from BLIMPS, MOTIFS, and PROFILESCAN analyses provide further coiroborative evidence 
that SEQ ID NO: 16 is a cation-transporting ATPase. In a further example, SEQ ID NO: 19 is 43% 
identical to Sinorhizobium sp. As4 ArsA, the catalytic subunit of the arsenic oxyanion-translocating 
ATPase (GenBank ID g5S02945) as determined by the Basic Local Alignment Search Tool 

30 (BLAST). (See Table 2.) The BLAST probability score is 7.7e-125, which indicates the probability 
of obtaining the observed polypeptide sequence alignment by chance. SEQ ID NO: 19 also contains 
an anion-transporting ATPase domain as determined by searching for statistically significant 
matches in the hidden Markov model (HMM)-based PFAM database of conserved protein family 
domains. (See Table 3.) Data from additional BLAST and PROFILESCAN analyses provide 

35 further corroborative evidence that SEQ ID NO: 19 is an anion-transporting ATPase. In yet a further 
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example, SEQ ID NO:21 is 54% identical to a murine putative E1-E2 ATPase (GenBank ID 
g28577) as detemrined by the Basic Local Alignment Search Tool (BLAST). (See Table 2.) The 
BLAST probability score is 5.2e-190, which indicates the probability of obtaining the observed 
polypeptide sequence alignment by chance. SEQ ID NO:21 also contains six transmembrane 
5 domains as determined using TMAP, a program which delineates transmembrane segments. (See 
Table 3.) Data from BLIMPS, and MOTIFS, analyses provide further corroborative evidence that 
SEQ ID NO:21 is an ATPase. In a father example, SEQ ID NO:24 is 52% identical, from residue 
A77 to residue L1007, to rat NMDAR-L (GenBank ID g2160125) as determined by the Basic Local 
Alignment Search Tool (BLAST). (See Table 2.) The BLAST probability score is L5e-262, which 

10 indicates the probability of obtaining the observed polypeptide sequence alignment by chance. SEQ 
ID NO:24 also contains a ligand gated ion channel domain as determined by searching for 
statistically significant matches in the hidden Markov model (HMM)-based PFAM database of . ✓ 
conserved protein family domains. (See Table 3.) Data from BLIMPS analysis provide further 
corroborative evidence that SEQ ID NO:24 is a glutamate receptor. SEQ ID NO:2-6, SEQ ID NO:8- 

15 12, SEQ ID NO: 14-15, SEQ ID NO: 17-18, SEQ ID NO:20, SEQ ID NO:22-23, and SEQ ID NO:25- 
32 were analyzed and annotated in a similar manner. The algorithms and parameters for the 
analysis of SEQ ID NO: 1-32 are described in Table 7. 

As shown in Table 4, the full length polynucleotide sequences of the present invention were 
assembled using cDNA sequences or coding (exon) sequences derived from genomic DNA, or any 

20 combination of these two types of sequences. Column 1 lists the polynucleotide sequence 
identification number (Polynucleotide SEQ ID NO:), the corresponding Incyte polynucleotide 
consensus sequence number (Incyte ID) for each polynucleotide of the invention, and the length of 
each polynucleotide sequence in basepairs. Column 2 shows the nucleotide start (5') and stop (3') 
positions of the cDNA and/or genomic sequences used to assemble the full length polynucleotide 

25 sequences of the invention, and of fragments of the polynucleotide sequences which are useful, for 
example, in hybridization or amplification technologies that identify SEQ ID NO:33-64 or that 
distinguish between SEQ ID NO:33-64 and related polynucleotide sequences. 

The polynucleotide fragments described in Column 2 of Table 4 may refer specifically, for 
example, to Incyte cDNAs derived from tissue-specific cDNA libraries or from pooled cDNA 

30 libraries. Alternatively, the polynucleotide fragments described in column 2 may refer to GenBank 
cDNAs or ESTs which contributed to the assembly of the full length polynucleotide sequences. In 
addition, the polynucleotide fragments described in column 2 may identify sequences derived from 
the ENSEMBL (The Sanger Centre, Cambridge, UK) database (Le., those sequences including the 
designation "ENST"). Alternatively, the polynucleotide fragments described in column 2 may be 

35 derived from the NCBI RefSeq Nucleotide Sequence Records Database (I e. , those sequences 
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including the designation "NM" or "NT") or the NCBI RefSeq Protein Sequence Records (i.e., 
those sequences including the designation "NP"). Alternatively, the polynucleotide fragments 
described in column 2 may refer to assemblages of both cDNA and Genscan-predicted exons 
brought together by an "exon stitching" algorithm. For example, a polynucleotide sequence 
5 identified as FLJCXXXXXJSf I _N 2 jm r YY^N 3 _N 4 represents a "stitched" sequence in which 
XXXXXX is the identification number of the cluster of sequences to which the algorithm was 
applied, and YYYYY is the number of the prediction generated by the algorithm, and N lt2 ^ if 
present, represent specific exons that may have been manually edited during analysis (See Example 
V). Alternatively, the polynucleotide fragments in column 2 may refer to assemblages of exons 

10 brought together by an "exon-stretching" algorithm. For example, a polynucleotide sequence 
identified as FUCXXXXX_gAAAAA_gBBBBB_l_N is a "stretched" sequence, with XXXXXX being 
the Incyte project identification number, gAAAAA being the GenBank identification number of the ' 
human genomic sequence to which the "exon-stretching" algorithm was applied, gBBBBB being the 
GenBank identification number or NCBI RefSeq identification number of the nearest GenBank 

15 protein homolog, and N referring to specific exons (See Example V). In instances where a RefSeq 
sequence was used as a protein homolog for the "exon-stretching" algorithm, a RefSeq identifier 
(denoted by "NM," "NP," or "NT") may be used in place of the GenBank identifier (Le. y gBBBBB). 

Alternatively, a prefix identifies component sequences that were hand-edited, predicted 
from genomic DNA sequences, or derived from a combination of sequence analysis methods. The 

20 following Table lists examples of component sequence prefixes and corresponding sequence 
analysis methods associated with the prefixes (see Example IV and Example V). 



Prefix 


Type of analysis and/or examples of programs 


GNN, GFG, 
ENST 


Exon prediction from genomic sequences using, for example, 
GENSCAN (Stanford University, CA, USA) or FGENES 
(Computer Genomics Group, The Sanger Centre, Cambridge, UK). 


GBI 


Hand-edited analysis of genomic sequences. 


FL 


Stitched or stretched genomic sequences (see Example V). 


INCY 


Full length transcript and exon prediction from mapping of EST 
sequences to the genome. Genomic location and EST composition 
data are combined to predict the exons and resulting transcript. 



In some cases, Incyte cDNA coverage redundant with the sequence coverage shown in 
30 Table 4 was obtained to confirm the final consensus polynucleotide sequence, but the relevant 
Incyte cDNA identification numbers are not shown. 

Table 5 shows the representative cDNA libraries for those full length polynucleotide 
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sequences which were assembled using Incyte cDNA sequences. The representative cDNA library 
is the Incyte cDNA library which is most frequently represented by the Incyte cDNA sequences 
which were used to assemble and confirm the above polynucleotide sequences. The tissues and 
vectors which were used to construct the cDNA libraries shown in Table 5 are described in Table 6. 

5 The invention also encompasses TRICH variants. A preferred TRICH variant is one which 

has at least about 80%, or alternatively at least about 90%, or even at least about 95% amino acid 
sequence identity to the TRICH amino acid sequence, and which contains at least one functional or 
structural characteristic of TRICH. 

The invention also encompasses polynucleotides which encode TRICH. In a particular 

10 embodiment, the invention encompasses a polynucleotide sequence comprising a sequence selected 
from the group consisting of SEQ ID NO:33-64, which encodes TRICH. The polynucleotide 
sequences of SEQ ID NO:33-64, as presented in the Sequence Listing, embrace the equivalent RNA^ 
sequences, wherein occurrences of the nitrogenous base thymine are replaced with uracil, and the 
sugar backbone is composed of ribose instead of deoxyribose. 

15 The invention also encompasses a variant of a polynucleotide sequence encoding TRICH. 

In particular, such a variant polynucleotide sequence will have at least about 70%, or alternatively at 
least about 85%, or even at least about 95% polynucleotide sequence identity to the polynucleotide 
sequence encoding TRICH. A particular aspect of the invention encompasses a variant of a 
polynucleotide sequence comprising a sequence selected from the group consisting of SEQ ID 

20 NO:33-64 which has at least about 70%, or alternatively at least about 85%, or even at least about 
95% polynucleotide sequence identity to a nucleic acid sequence selected from the group consisting 
of SEQ ID NO:33-64. Any one of the polynucleotide variants described above can encode an amino 
acid sequence which contains at least one functional or structural characteristic of TRICH. 

In addition, or in the alternative, a polynucleotide variant of the invention is a splice variant 

25 of a polynucleotide sequence encoding TRICH. A splice variant may have portions which have 
significant sequence identity to the polynucleotide sequence encoding TRICH, but will generally 
have a greater or lesser number of polynucleotides due to additions or deletions of blocks of 
sequence arising from alternate splicing of exons during mKNA processing. A splice variant may 
have less than about 70%, or alternatively less than about 60%, or alternatively less than about 50% 

30 polynucleotide sequence identity to the polynucleotide sequence encoding TRICH over its entire 
length; however, portions of the splice variant will have at least about 70%, or alternatively at least 
, about 85%, or alternatively at least about 95%, or alternatively 100% polynucleotide sequence 
identity to portions of the polynucleotide sequence encoding TRICH. Any one of the splice variants 
described above can encode an amino acid sequence which contains at least one functional or 

35 structural characteristic of TRICH. 
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It will be appreciated by those skilled in the art that as a result of the degeneracy of the 
genetic code, a multitude of polynucleotide sequences encoding TRICH, some bearing minimal 
similarity to the polynucleotide sequences of any known and naturally occurring gene, may be 
produced. Thus, the invention contemplates each and every possible variation of polynucleotide 
5 sequence that could be made by selecting combinations based on possible codon choices. These 
combinations are made in accordance with the standard triplet genetic code as applied to the 
polynucleotide sequence of naturally occurring TRICH, and all such variations are to be considered 
as being specifically disclosed. 

Although nucleotide sequences which encode TRICH and its variants are generally capable 
10 of hybridizing to the nucleotide sequence of the naturally occurring TRICH under appropriately 
selected conditions of stringency, it may be advantageous to produce nucleotide sequences encoding 
TRICH or its derivatives possessing a substantially different codon usage, e.g., inclusion of non- ' 
naturally occurring codons. Codons may be selected to increase the rate at which expression of the 
peptide occurs in a particular prokaryotic or eukaryotic host in accordance with the frequency with 
15 which particular codons are utilized by the host. Other reasons for substantially altering the 

nucleotide sequence encoding TRICH and its derivatives without altering the encoded amino acid 
sequences include the production of RNA transcripts having more desirable properties, such as a 
greater half-life, than transcripts produced from the naturally occurring sequence. 

The invention also encompasses production of DNA sequences which encode TRICH and 
20 TRICH derivatives, or fragments thereof, entirely by synthetic chemistry. After production, the 
synthetic sequence may be inserted into any of the many available expression vectors and cell 
systems using reagents well known in the art. Moreover, synthetic chemistry may be used to 
introduce mutations into a sequence encoding TRICH or any fragment thereof. 

Also encompassed by the invention are polynucleotide sequences that are capable of 
25 hybridizing to the claimed polynucleotide sequences, and, in particular, to those shown in SEQ ID 
NO:33-64 and fragments thereof under various conditions of stringency. (See, e.g., Wahl, G.M. and 
S.L. Berger (1987) Methods Enzymol. 152:399-407; Kimmel, A.R. (1987) Methods Enzymol. 
152:507-511.) Hybridization conditions, including annealing and wash conditions, are described in 
"Definitions." 

30 Methods for DNA sequencing are well known in the art and may be used to practice any of 

the embodiments of the invention. The methods may employ such enzymes as the Klenow fragment 
of DNA polymerase I, SEQUENASE (US Biochemical, Cleveland OH), Taq polymerase (Applied 
Biosystems), thermostable T7 polymerase (Amersham Pharmacia Biotech, Piscataway NI), or 
combinations of polymerases and proofreading exonucleases such as those found in the 

35 ELONGASE amplification system (Life Technologies, Gaithersburg MD). Preferably, sequence 
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preparation is automated with machines such as the MICROLAB 2200 liquid transfer system 
(Hamilton, Reno NV), PTC200 thermal cycler (MJ Research, Watertown MA) and ABI 
CATALYST 800 thermal cycler (Applied Biosystems). Sequencing is then carried out using either 
the ABI 373 or 377 DNA sequencing system (Applied Biosystems), the MEGABACE 1000 DNA 
5 sequencing system (Molecular Dynamics, Sunnyvale CA), or other systems known in the art. The 
resulting sequences are analyzed using a variety of algorithms which are well known in the art. 
(See, e.g., Ausubel, F.M. (1997) Short Protocols in Molecular Biology , John Wiley & Sons, New ' 
York NY, unit 7.7; Meyers, R.A. (1995) Molecular Biology and Biotechnology . Wiley VCH, New 
York NY, pp. 856-853.) 

10 The nucleic acid sequences encoding TRICH may be extended utilizing a partial nucleotide 

sequence and employing various PCR-based methods known in the art to detect upstream , 
sequences, such as promoters and regulatory elements. For example, one method which may be . ✓ 
employed, restriction-site PCR, uses universal and nested primers to amplify unknown sequence 
from genomic DNA within a cloning vector. (See, e.g., Sarkar, G. (1993) PCR Methods Applic. 

15 2:3 18-322.) Another method, inverse PCR, uses primers that extend in divergent directions to 
amplify unknown sequence from a circularized template. The template is derived from restriction 
fragments comprising a known genomic locus and surrounding sequences. (See, e.g., Triglia, T. et 
al. (1988) Nucleic Acids Res. 16:8186.) A third method, capture PCR, involves PCR amplification 
of DNA fragments adjacent to known sequences in human and yeast artificial chromosome DNA. 

20 (See, e.g., Lagerstrom, M. et al. (199 1) PCR Methods Applic. 1:111-119.) In this method, multiple ' 
restriction enzyme digestions and ligations may be used to insert an engineered double-stranded 
sequence into a region of unknown sequence before performing PCR. Other methods which may be 
used to retrieve unknown sequences are known in the art. (See, e.g., Parker, J.D. et al. (1991) 
Nucleic Acids Res. 19:3055-3060). Additionally, one may use PCR, nested primers, and 

25 PROMOTERFINDER libraries (Clontech, Palo Alto CA) to walk genomic DNA. This procedure 
avoids the need to screen libraries and is useful in finding intron/exon junctions. For all PCR-based 
methods, primers may be designed using commercially available software, such as OLIGO 4.06 
primer analysis software (National Biosciences, Plymouth MN) or another appropriate program, to 
be about 22 to 30 nucleotides in length, to have a GC content of about 50% or more, and to anneal 

30 to the template at temperatures of about 68°C to 72°C. 

When screening for full length cDNAs, it is preferable to use libraries that have been 
size-selected to include larger cDNAs. In addition, random-primed libraries, which often include 
sequences containing the 5' regions of genes, are preferable for situations in which ah oligo ld(T) ' 
library does not yield a full-length cDNA. Genomic libraries may be useful for extension of 

35 sequence into 5' non-transcribed regulatory regions. 
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Capillary electrophoresis systems which are commercially available may be used to analyze 
the size or confirm the nucleotide sequence of sequencing or PCR products In particular, capillary 
sequencing may employ flowable polymers for electrophoretic separation, four different nucleotide- 
specific, laser-stimulated fluorescent dyes, and a charge coupled device camera for detection of the 
5 emitted wavelengths. Output/light intensity may be converted to electrical signal using appropriate 
software (e.g., GENOTYPER and SEQUENCE NAVIGATOR, Applied Biosystems), and the entire 
process from loading of samples to computer analysis and electronic data display may be computer 
controlled. Capillary electrophoresis is especially preferable for sequencing small DNA fragments 
which may be present in limited amounts in a particular sample. 

10 In another embodiment of the invention, polynucleotide sequences or fragments thereof 

which encode TRICH may be cloned in recombinant DNA molecules that direct expression of 
TRICH, or fragments or functional equivalents thereof, in appropriate host cells. Due to the ^ 
inherent degeneracy of the genetic code, other DNA sequences which encode substantially the same 
or a functionally equivalent amino acid sequence may be produced and used to express TRICH. 

15 The nucleotide sequences of the present invention can be engineered using methods 

generally known in the art in order to alter TRICH-encoding sequences for a variety of purposes 
including, but not limited to, modification of the cloning, processing, and/or expression of the gene 
product. DNA shuffling by random fragmentation and PCR reassembly of gene fragments and 
synthetic oligonucleotides may be used to engineer the nucleotide sequences. For example, 
20 oligonucleotide-mediated site-directed mutagenesis may be used to introduce mutations that create 
new restriction sites, alter glycosylation patterns, change codon preference, produce splice variants, 
and so forth. 

The nucleotides of the present invention may be subjected to DNA shuffling techniques 
such as MOLECULARBREEDING (Maxygen Inc., Santa Clara CA; described in U.S. Patent No. 

25 5,837,458; Chang, C.-C. et al. (1999) Nat. Biotechnol. 17:793-797; Christians; F.C. et al. (1999) 
Nat. Biotechnol. 17:259-264; and Crameri, A. et al. (1996) Nat. Biotechnol. 14:315-319) to alter or 
improve the biological properties of TRICH, such as its biological or enzymatic activity or its 
ability to bind to other molecules or compounds. DNA shuffling is a process by which a library of 
gene variants is produced using PCR-mediated recombination of gene fragments. The library is 

30 then subjected to selection or screening procedures that identify those gene variants with the desired 
properties. These preferred variants may then be pooled and further subjected to recursive rounds 
of DNA shuffling and selection/screening. Thus, genetic diversity is created through "artificial" 
breeding and rapid molecular evolution. For example, fragments of a single gene containing 
random point mutations may be recombined, screened, and then reshuffled until the desired 

35 properties are optimized. Alternatively, fragments of a given gene may be recombined with 
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fragments of homologous genes in the same gene family, either from the same or different species, 
thereby maximizing the genetic diversity of multiple naturally occurring genes in a directed and 
controllable manner. 

In another embodiment, sequences encoding TRICH may be synthesized, in whole or in 
5 part, using chemical methods well known in the art. (See, e.g., Caruthers, M.H. et al. (1980) 
Nucleic Acids Symp. Ser. 7:215-223; and Horn, T. et al. (1980) Nucleic Acids Symp. Ser. 
7:225-232.) Alternatively, TRICH itself or a fragment thereof may be synthesized using chemical 
methods. For example, peptide synthesis can be performed using various solution-phase or 
solid-phase techniques. (See, e.g., Creighton, T. (1984) Proteins, Structures and Molecular 
10 Properties , WH Freeman, New York NY, pp. 55-60; and Roberge, J.Y. et al. (1995) Science 
269:202-204.) Automated synthesis may be achieved using the ABI 43 1 A peptide synthesizer 
(Applied Biosystems). Additionally, the amino acid sequence of TRICH, or any part thereof, may ; 
be altered during direct synthesis and/or combined with sequences from other proteins, or any part 
thereof, to produce a variant polypeptide or a polypeptide having a sequence of a naturally occurring 
15 polypeptide. 

The peptide may be substantially purified by preparative high performance liquid 
chromatography. (See, e.g., Chiez, R.M. and F.Z. Regnier (1990) Methods Enzymol. 182:392-421.) 
The composition of the synthetic peptides may be confirmed by amino acid analysis or by 
sequencing. (See, e.g., Creighton, simra, pp. 28-53.) 

20 In order to express a biologically active TRICH, the nucleotide sequences encoding TRICH ' 

or derivatives thereof may be inserted into an appropriate expression vector, i.e., a vector which 
contains the necessary elements for transcriptional and translational control of the inserted coding 
sequence in a suitable host. These elements include regulatory sequences, such as enhancers, 
constitutive and inducible promoters, and 5* and 3' untranslated regions in the vector and in 

25 polynucleotide sequences encoding TRICH. Such elements may vary in their strength and 
specificity. Specific initiation signals may also be used to achieve more efficient translation of 
sequences encoding TRICH. Such signals include the ATG initiation codon and adjacent 
sequences, e.g. the Kozak sequence. In cases where sequences encoding TRICH and its initiation 
codon and upstream regulatory sequences are inserted into the appropriate expression vector, no 

30 additional transcriptional or translational control signals may be needed. However, in cases where 
only coding sequence, or a fragment thereof, is inserted, exogenous translational control signals 
including an in-frame ATG initiation codon should be provided by the vector. Exogenous 
translational elements and initiation codons may be of various origins, both natural and synthetic. 
The efficiency of expression may be enhanced by-the inclusion of enhancers,appropriate for the 

35 particular host cell system used. (See, e.g., Scharf, D. et al. (1994) Results Probl. Cell Differ. 
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20:125-162.) 

Methods which are well known to those skilled in the art may be used to construct 
expression vectors containing sequences encoding TRICH and appropriate transcriptional and 
translational control elements. These methods include in vitro recombinant DNA techniques, 
5 synthetic techniques, and in vivo genetic recombination. (See, e.g., Sambrook, J. etal. (1989) 
Molecular Cloning A Laboratory Manual . Cold Spring Harbor Press, Plainview NY, ch. 4, 8, and 
16-17; Ausubel, F.M. et al. (1995) Current Protocols in Molecular Biology . John Wiley & Sons, 
New York NY, ch. 9, 13, and 16.) 

A variety of expression vector/host systems may be utilized to contain and express 

10 sequences encoding TRICH. These include, but are not limited to, microorganisms such as bacteria 
transformed with recombinant bacteriophage, plasmid, or cosmid DNA expression vectors; yeast 
transformed with yeast expression vectors; insect cell systems infected with viral expression vectors 
(e.g., baculovirus); plant cell systems transformed with viral expression vectors (e.g., cauliflower 
mosaic virus, CaMV, or tobacco mosaic virus, TMV) or with bacterial expression vectors (e.g., Ti 

15 or pBR322 plasmids); or animal cell systems. (See, e.g., Sambrook, supra : Ausubel, supra : Van 
Heeke, G. and S.M. Schuster (1989) J. Biol. Chem. 264:5503-5509; Engelhard, E.K. et al. (1994) 
Proc. Natl. Acad. Sci. USA 91:3224-3227; Sandig, V. et al. (1996) Hum. Gene Ther. 7:1937-1945; 
Takamatsu, N. (1987) EMBO J. 6:307-311; The McGraw Hill Yearbook of Science and Technology 
(1992) McGraw Hill, New York NY, pp. 191-196; Logan, J. and T. Shenk (1984) Proc. Natl. Acad. 

20 Sci. USA 81:3655-3659; and Harrington, J.J. et al. (1997) Nat. Genet. 15:345-355.) Expression 
vectors derived from retroviruses, adenoviruses, or herpes or vaccinia viruses, or from various 
bacterial plasmids, may be used for delivery of nucleotide sequences to the targeted organ, tissue, or 
cell population. (See, e.g., Di Nicola, M: et al. (1998) Cancer Gen. Ther. 5(6):350-356; Yu, M. et 
al. (1993) Proc. Natl. Acad. Sci. USA 90(13):6340-6344; Buller, R.M. et al. (1985) Nature 

25 317(6040):813-815; McGregor, D.P. et al. (1994) Mol. Immunol. 31(3):219-226; and Verma, I.M. 
and N. Somia (1997) Nature 389:239-242.) The invention is not limited by the host cell employed. 

In bacterial systems, a number of cloning and expression vectors may be selected depending 
upon the use intended for polynucleotide sequences encoding TRICH. For example, routine 
cloning, subcloning, and propagation of polynucleotide sequences encoding TRICH can be achieved 

30 using a multifunctional E. coli vector such as PBLUESCRBT (Stratagene, La Jolla CA) or 

PSPORT1 plasmid (Life Technologies). Ligation of sequences encoding TRICH into the vector's 
multiple cloning site disrupts the lacZ gene, allowing a colorimetric screening procedure for 
identification of transformed bacteria containing recombinant molecules. In addition, these vectors 
may be useful for in vitro transcription, dideoxy sequencing, single strand rescue with helper phage, 

35 and creation of nested deletions in the cloned sequence. (See, e.g., Van Heeke, G. and S.M. 
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Schuster (1989) J. Biol. Chem. 264:5503-5509.) When large quantities of TRICH are needed, e.g. 
for the production of antibodies, vectors which direct high level expression of TRICH may be used. 
For example, vectors containing the strong, inducible SP6 or T7 bacteriophage promoter may be 
used. 

5 Yeast expression systems may be used for production of TRICH. A number of vectors 

containing constitutive or inducible promoters, such as alpha factor, alcohol oxidase, and PGH 
promoters, may be used in the yeast Saccharomvces cerevisiae or Pichia pastoris . In addition, such 
vectors direct cither the secretion or intracellular retention of expressed proteins and enable 
integration of foreign sequences into the host genome for stable propagation. (See, e.g., Ausubel, 

10 1995, supra ; Bitter, G.A. et al. (1987) Methods Enzymol. 153:516-544; and Scorer, C.A. et al. 
(1994)Bio/Tcchhology 12:18I-IS4.) 

Plant systems may also be used for expression of TRICH. Transcription of sequences ✓ 
encoding TRICH may be driven by viral promoters, e.g., the 35S and 19S promoters of CaMV used 
alone or in combination with the omega leader sequence from TMV (Takamatsu, N. (1987) EMBO 

15 J. 6:307-311). Alternatively, plant promoters such as the small subunit of.RUBISCO or heat shock 
promoters may be u*ed. (Sec. e.g.. Coruzzi, G. et al. (1984) EMBO J. 3:1671-1680; Broglie, R. et 
al. (1984) Science 224:838-843; and Winter, J. et al. (1991) Results Probl. Cell Differ. 17:85-105.) 
These constructs can be introduced into plant cells by direct DNA transformation or 
^pathogen-mediated transfection. (See, e.g., The McGraw Hill Yearbook of Science and Technology 

20 (1992) McGraw Hill. New York NY, pp. 191496.) 

In mammalian cells, a number of viral-based expression systems may be utilized. In cases 
where an adenovirus is used as an expression vector, sequences encoding TRICH may be ligated 
into an adenovirus transcription/translation complex consisting of the late promoter and tripartite 
leader sequence. Insertion in a non-essential El or E3 region of the viral genome may be used to 

25 obtain infective virus which expresses TRICH in host cells. (See, e.g., Logan, J. and T. Shenk 

(1984) Proc. Natl. Acad. Sci. USA 81:3655-3659.) In addition, transcription enhancers, such as the 
Rous sarcoma virus (RSV) enhancer, may be used to increase expression in mammalian host cells. 
SV40 or EBV-based vectors may also be used for high-level protein expression. 

Human artificial chromosomes (HACs) may also be employed to deliver larger fragments of 

30 DNA than can be contained in and expressed from a plasmid. HACs of about 6 kb to 10 Mb are 
constructed and delivered via conventional delivery methods (liposomes, polycationic amino 
polymers, or vesicles) for therapeutic purposes. (See, e.g., Harrington, JJ. et al. (1997) Nat. Genet. 
15:345-355.) 

For long term production of recombinant proteins in mammalian systems, stable expression ' 
35 of TRICH in cell lines is preferred. For example, sequences encoding TRICH can be transformed 
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into cell lines using expression vectors which may contain viral origins of replication and/or 
endogenous expression elements and a selectable marker gene on the same or on a separate vector. 
Following the introduction of the vector, cells may be allowed to grow for about 1 to 2 days in 
enriched media before being switched to selective media. The purpose of the selectable marker is to 
5 confer resistance to a selective agent, and its presence allows growth and recovery of cells which 
successfully express the introduced sequences. Resistant clones of stably transformed cells may be 
propagated using tissue culture techniques appropriate to the cell type. 

Any number of selection systems may be used to recover transformed cell lines. These 
include, but are not limited to, the herpes simplex virus thymidine kinase and adenine 
10 phosphoribosyltransferase genes, for use in tk and apr cells, respectively. (See, e.g., Wigler, M. et 
al. (1977) Cell 11:223-232; Lowy, I. et al. (1980) Cell 22:817-823.) Also, antimetabolite, antibiotic, 
or herbicide resistance can be used as the basis for selection. For example, dhfr confers resistance ' 
to methotrexate; neo confers resistance to the aminoglycosides neomycin and G-418; and als and 
pat confer resistance to chlorsulfuron and phosphinotricin acetyltransferase, respectively. (See, e.g., 
15 Wigler, M. et al. (1980) Proc. Natl. Acad. ScL USA 77:3567-3570; Colbere-Garapin, F. et al. (1981) 
J. Mol. Biol. 150: 1-14.) Additional selectable genes have been described, e.g., trpB and hisD, 
which alter cellular requirements for metabolites. (See, e.g., Hartman, S.C. and R.C. Mulligan 
(1988) Proc. Natl. Acad. Sci. USA 85:8047-8051.) Visible markers, e.g., anthocyanins, green 
fluorescent proteins (GFP; Clontech), 6 glucuronidase and its substrate B-glucuronide, or luciferase 
20 and its substrate luciferin may be used. These markers can be used not only to identify 

transformants, but also to quantify the amount of transient or stable protein expression attributable 
to a specific vector system. (See, e.g., Rhodes, C.A. (1995) Methods Mol. Biol. 55:121-131.) 

Although the presence/absence of marker gene expression suggests that the gene of interest 
is also present, the presence and expression of the gene may need to be confirmed. For example, if 
25 the sequence encoding TRICH is inserted within a marker gene sequence, transformed cells 

containing sequences encoding TRICH can be identified by the absence of marker gene function. 
Alternatively, a marker gene can be placed in tandem with a sequence encoding TRICH under the 
control of a single promoter. Expression of the marker gene in response to induction or selection 
usually indicates expression of the tandem gene as well. 
30 In general, host cells that contain the nucleic acid sequence encoding TRICH and that 

express TRICH may be identified by a variety of procedures known to those of skill in the art. 
These procedures include, but are not limited to, DNA-DNA or DNA-RNA hybridizations, PCR 
amplification, and protein bipassay or immunoassay techniques which include membrane, solution, 
or chip based technologies for the detection and/or quantification of nucleic acid or protein 
35 sequences. 
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Immunological methods for detecting and measuring the expression of TRICH using either 
specific polyclonal or monoclonal antibodies are known in the art. Examples of such techniques 
include enzyme-linked immunosorbent assays (ELISAs), radioimmunoassays (RIAs), and 
fluorescence activated cell sorting (FACS). A two-site, monoclonal-based immunoassay utilizing 
5 monoclonal antibodies reactive to two non-interfering epitopes on TRICH is preferred, but a 
competitive binding assay may be employed. These and other assays are well known in the art. 
(See, e.g., Hampton, R. et al. (1990) Serological Methods, a Laboratory Manual . APS Press, St. Paul 
MN, Sect. IV; Coligan, J.E. et al. (1997) Current Protocols in Immunology . Greene Pub. Associates 
and Wiley-Interscience, New York NY; and Pound, J.D. (1998) Immunochemical Protocols . 

10 Humana Press, Totowa NJ.) 

. A wide variety of labels and conjugation techniques are known by those skilled in the art 
and may be used in various nucleic acid and amino acid assays. Means for producing labeled ^ 
hybridization or PCR probes for detecting sequences related to polynucleotides encoding TRICH 
include oligolabeling, nick translation, end-labeling, or PCR amplification using a labeled 

15 nucleotide. Alternatively, the sequences encoding TRICH, or any fragments thereof, may be cloned 
into a vector for the production of an mRNA probe. Such vectors are known in the art, are 
commercially available, and may be used to synthesize RNA probes in vitro by addition of an 
appropriate RNA polymerase such as T7, T3, or SP6 and labeled nucleotides. These procedures 
may be conducted using a variety of commercially available kits, such as those provided by 

20 Amersham Pharmacia Biotech, Promega (Madison WI), and US Biochemical. Suitable reporter 
molecules or labels which may be used for ease of detection include radionuclides, enzymes, s 
fluorescent, chemiluminescent, or chromogenic agents, as well as substrates, cofactors, inhibitors, 
magnetic particles, and the like. s 

Host cells transformed with nucleotide sequences encoding TRICH may be cultured under 

25 conditions suitable for the expression and recovery of the protein from cell.culture. The protein 
produced by a transformed cell may be secreted or retained intracellularly depending on the ■ 
sequence and/or the vector used. As will be understood by those of skill in the art, expression 
vectors containing polynucleotides which encode TRICH may be designed to contain signal 
sequences which direct secretion of TRICH through a prokaryotic or eukaryotic cell membrane. 

30 In addition, a host cell strain may be chosen for its ability to modulate expression of the 

inserted sequences or to process the expressed protein in the desired fashion. Such modifications of 
the polypeptide include, but are not limited to, acetylation, carboxylation, glycosylation, 
phosphorylation, lipidation, and acylation. Post-translational processing which cleaves a"prepro" 
or "pro" form of the protein may also be used to specify protein targeting, folding, and/or activity. 

35 Different host cells which have specific cellular machinery and characteristic mechanisms for 
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post-translational activities (e.g., CHO, HeLa, MDCK, HEK293, and WI38) are available from the 
American Type Culture Collection (ATCC, Manassas VA) and may be chosen to ensure the correct 
modification and processing of the foreign protein. 

In another embodiment of the invention, natural, modified, or recombinant nucleic acid 
5 sequences encoding TRICH may be ligated to a heterologous sequence resulting in translation of a 
fusion protein in any of the aforementioned host systems. For example, a chimeric TRICH protein 
containing a heterologous moiety that can be recognized by a commercially available antibody may 
facilitate the screening of peptide libraries for inhibitors of TRICH activity. Heterologous protein 
and peptide moieties may also facilitate purification of fusion proteins using commercially available 

10 affinity matrices. Such moieties include, but are not limited to, glutathione S-transferase (GST), 
maltose binding protein (MBP). ihioredoxin (Trx), calmodulin binding peptide (CBP), 6-His, 
FLAG, c-myc. and hemagglutinin (HA). GST, MBP, Trx, CBP, and 6-His enable purification of j 
their cognate fusion protein* on immobilized glutathione, maltose, phenylarsine oxide, calmodulin, 
and metal-chelate resins, respectively. FLAG, c-myc, and hemagglutinin (HA) enable 

15 immunoaffinity purification of fusion proteins using commercially available monoclonal and 

polyclonal antibodies that specifically recognize these epitope tags. A fusion protein may also be 
engineered to contain a proteolytic cleavage site located between the TRICH encoding sequence and 
the heterologous protein sequence, so that TRICH may be cleaved away from the heterologous 
moiety following purification. Methods for fusion protein expression and purification are discussed 

20 in Ausubel (1995, supra , ch. 10). A variety of commercially available kits may also be used to 
facilitate expression and purification of fusion proteins. 

In a further embodiment of the invention, synthesis of radiolabeled TRICH may be achieved 
in vitro using the TNT rabbit reticulocyte lysate or wheat germ extract system (Promega). These 
systems couple transcription and translation of protein-coding sequences operably associated with 

25 the T7, T3, or SP6 promoters. Translation takes place in the presence of a radiolabeled amino acid 
precursor, for example, 35 S-methionine. 

TRICH of the present invention or fragments thereof may be used to screen for compounds 
that specifically bind to TRICH. At least one and up to a plurality of test compounds may be 
screened for specific binding to TRICH. Examples of test compounds include antibodies, 

30 oligonucleotides, proteins (e.g., receptors), or small molecules. 

In one embodiment, the compound thus identified is closely related to the natural ligand of 
TRICH, e.g., a ligand or fragment thereof, a natural substrate, a structural or functional mimetic, or 
a natural binding partner. (See, e.g., Coligan, J.E. et al. (1991) Current Pro tocols in Imm unology 
l(2):_Chapter 5.) Similarly, the compound can be closely related to the natural receptor to which 

35 TRICH binds, or to at least a fragment of the receptor, e.g., the ligand binding site. In either case, 
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the compound can be rationally designed using known techniques. In one embodiment, screening 
for these compounds involves producing appropriate cells which express TRICH, either as a 
secreted protein or on the cell membrane. Preferred cells include cells from mammals, yeast, 
Drosophila, or E. coli. Cells expressing TRICH or cell membrane fractions which contain TRICH 
5 are then contacted with a test compound and binding, stimulation, or inhibition of activity of either 
TRICH or the compound is analyzed. 

An assay may simply test binding of a test compound to the polypeptide, wherein binding is 
detected by a fluorophore, radioisotope, enzyme conjugate, or other detectable label For example, 
the assay may comprise the steps of combining at least one test compound with TRICH, either in 
10 solution or affixed to a solid support, and detecting the binding of TRICH to the compound. . 
Alternatively, the assay may detect or measure binding of a test compound in the presence of a 
labeled competitor. Additionally, the assay may be carried out using cell-free preparations, v 
chemical libraries, or natural product mixtures, and the test compound(s) may be free in solution or 
affixed to a solid support. 

15 TRICH of the present invention or fragments thereof may be used to screen for compounds 

that modulate the activity of TRICH. Such compounds may include agonists, antagonists, or partial 
or inverse agonists. In one embodiment, an assay is performed under conditions permissive for 
TRICH activity, wherein TRICH is combined with at least one test compound, and the activity of 
TRICH in the presence of a test compound is compared with the activity of TRICH in the absence 

20 of the test compound. A change in the activity of TRICH in the presence of the test compound is 
indicative of a compound that modulates the activity of TRICH. Alternatively, a test compound is 
combined with an in vitro or cell-free system comprising TRICH under conditions suitable for 
TRICH activity, and the assay is performed. In either of these assays, a test compound which 
modulates the activity of TRICH may do so indirectly and need not come in direct contact with the 

25 test compound. At least one and up to a plurality of test compounds may be screened. 

In another embodiment, polynucleotides encoding TRICH or their mammalian homologs 
may be "knocked out" in an animal model system using homologous recombination in embryonic 
stem (ES) cells. Such techniques are well known in the art and are useful for the generation of 
animal models of human disease. (See, e.g., U.S. Patent No. 5,175,383 and ILS. Patent No. 

30 5,767,337.) For example, mouse ES cells, such as the mouse 129/SvJ cell line, are derived from the 
early mouse embryo and grown in culture. The ES cells are transformed with a vector containing 
the gene of interest disrupted by a marker gene, e.g., the neomycin phosphotransferase gene (neo; 
Capecchi, M.R. (1989) Science 244: 1288-1292). The vector integrates into the corresponding 
region of the host genome by homologous recombination. Alternatively, homologous 

35 recombination takes place using the Cre-loxP system to knockout a gene of interest in a tissue- or 
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developmental stage-specific manner (Marth, J.D. (1996) Clin. Invest. 97:1999-2002; Wagner, K.U. 
et al. (1997) Nucleic Acids Res. 25:4323-4330). Transformed ES cells are identified and 
microinjected into mouse cell blastocysts such as those from the C57BL/6 mouse strain. The 
blastocysts are surgically transferred to pseudopregnant dams, and the resulting chimeric progeny 
5 are genotyped and bred to produce heterozygous or homozygous strains. Transgenic animals thus 
generated may be tested with potential therapeutic or toxic agents. 

Polynucleotides encoding TRICH may also be manipulated in vitro in ES cells derived from 
human blastocysts. Human ES cells have the potential to differentiate into at least eight separate 
cell lineages including endoderm, mesoderm, and ectodermal cell types. These cell lineages 

10 differentiate into, for example, neural cells, hematopoietic lineages, and cardiomyocytes (Thomson, 
J.A. et al. (1998) Science 282: 1 145-1 147). 

Polynucleotides encoding TRICH can also be used to create "knockin" humanized animals-' 
(pigs) or transgenic animals (mice or rats) to model human disease. With knockin technology, a 
region of a polynucleotide encoding TRICH is injected into animal ES cells, and the injected 

15 sequence integrates into the animal cell genome. Transformed cells are injected into blastulae, and 
the blastulae are implanted as described above. Transgenic progeny or inbred lines are studied and 
treated with potential pharmaceutical agents to obtain information on treatment of a human disease. 
Alternatively, a mammal inbred to overexpress TRICH, e.g., by secreting TRICH in its milk, may 
also serve as a convenient source of that protein (Janne, J. et al. (1998) Biotechnol. Annu. Rev. 

20 4:55-74). 

THERAPEUTICS 

Chemical and structural similarity, e.g., in the context of sequences and motifs, exists 
between regions of TRICH and transporters and ion channels. In addition, examples of tissues 
expressing TRICH can be found in Table 6. Therefore, TRICH appears to play a role in transport, 

25 neurological, muscle, immunological and cell proliferative disorders. In the treatment of disorders 
associated with increased TRICH expression or activity, it is desirable to decrease the expression or 
activity of TRICH. In the treatment of disorders associated with decreased TRICH expression or 
activity, it is desirable to increase the expression or activity of TRICH. 

Therefore, in one embodiment, TRICH or a fragment or derivative thereof may be 

30 administered to a subject to treat or prevent a disorder associated with decreased expression or 
activity of TRICH. Examples of such disorders include, but are not limited to, a transport disorder 
such as akinesia, amyotrophic lateral sclerosis, ataxia telangiectasia, cystic fibrosis, Becker's 
muscular dystrophy, Bell's palsy, Charcot-Marie Tooth disease, diabetes mellitus, diabetes 
insipidus, diabetic neuropathy, Duchenne muscular dystrophy, hyperkalemic periodic paralysis, 

35 normokalemic periodic paralysis, Parkinson's disease, malignant hyperthermia, multidrug 
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resistance, myasthenia gravis, myotonic dystrophy, catatonia, tardive dyskinesia, dystonias, 
peripheral neuropathy, cerebral neoplasms, prostate cancer, cardiac disorders associated with 
transport, e.g., angina, bradyairythrnia, tachyarrythmia, hypertension, Long QT syndrome, 
myocarditis, cardiomyopathy, nemaline myopathy, centronuclear myopathy, lipid myopathy, 
5 mitochondrial myopathy, thyrotoxic myopathy, ethanol myopathy, dermatomyositis, inclusion body 
myositis, infectious myositis, polymyositis, neurological disorders associated with transport, e.g., 
Alzheimer's disease, amnesia, bipolar disorder, dementia, depression, epilepsy, Tourette's disorder, 
paranoid psychoses, and schizophrenia, and other disorders associated with transport, e.g., 
neurofibromatosis, postherpetic neuralgia, trigeminal neuropathy, sarcoidosis, sickle cell anemia, 

10 Wilson's disease, cataracts, infertility, pulmonary artery stenosis, sensorineural autosomal deafness, 
hyperglycemia, hypoglycemia, Grave's disease, goiter, Cushing's disease, Addison's disease, 
glucose-galactose malabsorption syndrome, glycogen storage disease, hypercholesterolemia, ' 
adrenoleukodystrophy, Zellweger syndrome, Menkes disease, occipital horn syndrome, von Gierke 
disease, pseudohypoaldosteronism type 1, Liddle's syndrome, cystinuria, iminoglycinuria, Hartup 

15 disease, Fanconi disease, and Bartter syndrome; a neurological disorder such as epilepsy, ischemic 
cerebrovascular disease, stroke, cerebral neoplasms, Alzheimer's disease, Pick's disease, 
Huntington's disease, dementia, Parkinson's disease and other extrapyramidal disorders, 
amyotrophic lateral sclerosis and other motor neuron disorders, progressive neural muscular 
atrophy, retinitis pigmentosa, hereditary ataxias, multiple sclerosis and other demyelinating 

20 diseases, bacterial and viral meningitis, brain abscess, subdural empyema, epidural abscess, 
suppurative intracranial thrombophlebitis, myelitis and radiculitis, viral central nervous system 
disease, prion diseases including kuru, Creutzfeldt- Jakob disease, and Gerstmann- 
Straussler-Scheinker syndrome, fatal familial insomnia, nutritional and metabolic diseases of the 
nervous system, neurofibromatosis, tuberous sclerosis, cerebelloretinal hemangioblastomatosis, 

25 encephalotrigeminal syndrome, mental retardation and other developmental disorders of the central 
nervous system including Down syndrome, cerebral palsy, neuroskeletal disorders, autonomic 
nervous system disorders, cranial nerve disorders, spinal cord diseases, muscular dystrophy and 
other neuromuscular disorders, peripheral nervous system disorders, dermatomyositis and 
polymyositis, inherited, metabolic, endocrine, and toxic myopathies, myasthenia gravis, periodic 

30 paralysis, mental disorders including mood, anxiety, and schizophrenic disorders, seasonal affective 
disorder (SAD), akathesia, amnesia, catatonia, diabetic neuropathy, hemiplegic migraine, tardive 
dyskinesia, dystonias, paranoid psychoses, postherpetic neuralgia, Tourette's disorder, progressive 
supranuclear palsy ^ corticbbasal degeneration, and familial frontotemporal dementia; a muscle 
disorder such as cardiomyopathy, myocarditis, Duchenne's muscular dystrophy, Becker's muscular 

35 dystrophy, myotonic dystrophy, central core disease, nemaline myopathy, centronuclear myopathy, 
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lipid myopathy, mitochondrial myopathy, infectious myositis, polymyositis, dermatomyositis, 
inclusion body myositis, thyrotoxic myopathy, ethanol myopathy, angina, anaphylactic shock, 
arrhythmias, asthma, cardiovascular shock, Cushing's syndrome, hypertension, hypoglycemia, 
myocardial infarction, migraine, pheochromocytoma, and myopathies including encephalopathy, 
5 epilepsy, Kearns-Sayre syndrome, lactic acidosis, myoclonic disorder, ophthalmoplegia, acid 
maltase deficiency (AMD, also known as Pompe's disease), generalized myotonia, and myotonia 
congenita; an immunological disorder such as acquired immunodeficiency syndrome (AIDS), 
Addison's disease, adult respiratory distress syndrome, allergies, ankylosing spondylitis, 
amy loidosis, anemia, asthma, atherosclerosis, autoimmune hemolytic anemia, autoimmune 

10 thyroiditis, autoimmune polyendocrinopathy-candidiasis-ectodeimal dystrophy (APECED), 
bronchitis, cholecystitis, contact dermatitis, Crohn's disease, atopic dermatitis, dermatomyositis, 
diabetes mellitus, emphysema, episodic lymphopenia with lymphocytotoxins, erythroblastosis ' 
fetalis, erythema nodosum, atrophic gastritis, glomerulonephritis, Goodpasture's syndrome, gout, 
Graves' disease, Hashimoto's thyroiditis, hypereosinophilia, irritable bowel syndrome, multiple 

15 sclerosis, myasthenia gravis, myocardial or pericardial inflammation, osteoarthritis, osteoporosis, 
pancreatitis, polymyositis, psoriasis, Reiter's syndrome, rheumatoid arthritis, scleroderma, 
Sjogren's syndrome, systemic anaphylaxis, systemic lupus erythematosus, systemic sclerosis, 
thrombocytopenic purpura, ulcerative colitis, uveitis, Werner syndrome, complications of cancer, 
hemodialysis, and extracorporeal circulation, viral, bacterial, fungal, parasitic, protozoal, and 

20 helminthic infections, and trauma; and a cell proliferative disorder such as actinic keratosis, 
arteriosclerosis, atherosclerosis, bursitis, cirrhosis, hepatitis, mixed connective tissue disease 
(MCTD), myelofibrosis, paroxysmal nocturnal hemoglobinuria, polycythemia vera, psoriasis, 
primary thrombocythemia, and cancers including adenocarcinoma, leukemia, lymphoma, melanoma, 
myeloma, sarcoma, teratocarcinoma, and, in particular, cancers of the adrenal gland, bladder, bone, 

25 bone marrow, brain, breast, cervix, gall bladder, ganglia, gastrointestinal tract, heart, kidney, liver, 
lung, muscle, ovary, pancreas, parathyroid, penis, prostate, salivary glands, skin, spleen, testis, 
thymus, thyroid, and uterus. 

In another embodiment, a vector capable of expressing TRICH or a fragment or derivative 
thereof may be administered to a subject to treat or prevent a disorder associated with decreased 

30 expression or activity of TRICH including, but not limited to, those described above. 

In a further embodiment, a composition comprising a substantially purified TRICH in 
conjunction with a suitable pharmaceutical carrier may be administered to a subject to treat or 
prevent a disorder associated with decreased expression or activity of TRICH including, but not 
limited to, those provided above. 

35 ^ still another embodiment, an agonist which modulates the activity of TRICH may be 
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administered to a subject to treat or prevent a disorder associated with decreased expression or 
activity of TRICH including, but not limited to, those listed above. 

In a further embodiment, an antagonist of TRICH may be administered to a subject to treat 
or prevent a disorder associated with increased expression or activity of TRICH. Examples of such 
5 disorders include, but are not limited to, those transport, neurological, muscle, immunological and 
cell proliferative disorders described above. In one aspect, an antibody which specifically binds 
TRICH may be used directly as an antagonist or indirectly as a targeting or delivery mechanism for 
bringing a pharmaceutical agent to cells or tissues which express TRICH. 

In an additional embodiment, a vector expressing the complement of the polynucleotide 

10 encoding TRICH may be administered to a subject to treat or prevent a disorder associated with 
increased expression or activity of TRICH including, but not limited to, those described above. 

In other embodiments, any of the proteins, antagonists, antibodies, agonists, complementary 
sequences, or vectors of the invention may be adirdnistered in combination with other appropriate 
therapeutic agents. Selection of the appropriate agents for use in combination therapy may be made 

15 by one of ordinary skill in the art, according to conventional pharmaceutical principles. The 

combination of therapeutic agents may act synergistically to effect the treatment or prevention of the 
various disorders described above. Using this approach, one may be able to achieve therapeutic 
efficacy with lower dosages of each agent, thus reducing the potential for adverse side effects. 

An antagonist of TRICH may be produced using methods which are generally known in the 

20 art. In particular, purified TRICH may be used to produce antibodies or to screen libraries of 

pharmaceutical agents to identify those which specifically bind TRICH. Antibodies to TRICH may 
also be generated using methods that are well known in the art. Such antibodies may include, but 
are not limited to, polyclonal, monoclonal, chimeric, and single chain antibodies, Fab fragments, 
and fragments produced by a Fab expression library. Neutralizing antibodies (i.e., those which 

25 inhibit dimer formation) are generally preferred for therapeutic use. 

For the production of antibodies, various hosts including goats, rabbits, rats, mice, humans, 
and others may be immunized by injection with TRICH or with any fragment or oligopeptide 
thereof which has immunogenic properties. Depending on the host species, various adjuvants may 
be used to increase immunological response. Such adjuvants include, but are not limited to, 

30 Freund's, mineral gels such as aluminum hydroxide, and surface active substances such as 

lysolecithin, pluronic polyols, polyanions, peptides, oil emulsions, KLH, and dinitrophenol. Among 
adjuvants used in humans, BCG (bacilli Calmette-Guerin) and Corvnebacterium parvum are 
especially preferable. 

It is preferred that the oligopeptides, peptides, or fragments used to induce antibodies to 
35 TRICH have an amino acid sequence consisting of at least about 5 amino acids, and generally will 
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consist of at least about 10 amino acids. It is also preferable that these oligopeptides, peptides, or 
fragments are identical to a portion of the amino acid sequence of the natural protein. Short 
stretches of TRICH amino acids may be fused with those of another protein, such as KLH, and 
antibodies to the chimeric molecule may be produced. 
5 Monoclonal antibodies to TRICH may be prepared using any technique which provides for 

the production of antibody molecules by continuous cell lines in culture. These include, but are not 
limited to, the hybridoma technique, the human B-cell hybridoma technique, and the EBV- 
hybridoma technique. (See, e.g., Kohler, G. et al. (1975) Nature 256:495-497; Kozbor, D. et al. 
(1985) J. Immunol. Methods 81:31-42; Cote, R.J. et al. (1983) Proc. Natl. Acad. Sci. USA 
10 80:2026-2030; and Cole, S.P. et al. (1984) Mol. Cell Biol. 62: 109-120.) 

In addition, techniques developed for the production of "chimeric antibodies," such as the 
splicing of mouse antibody genes to human antibody genes to obtain a molecule with appropriate . ✓ 
antigen specificity and biological activity, can be used. (See, e.g., Morrison, S.L. et al. (1984) Proc. 
Natl. Acad. Sci. USA 81:6851-6855; Neuberger, M.S. et al. (1984) Nature 312:604-608; and 
15 Takeda, S. et al. (1985) Nature 314:452-454.) Alternatively, techniques described for the 

production of single chain antibodies may be adapted, using methods known in the art, to produce 
TRICH-specific single chain antibodies. Antibodies with related specificity, but of distinct 
idiotypic composition, may be generated by chain shuffling from random combinatorial 
immunoglobulin libraries. (See, e.g., Burton, D.R. (1991) Proc. Natl. Acad. Sci. USA 88:10134- 
20 10137.) 

Antibodies may also be produced by inducing in vivo production in the lymphocyte 
population or by screening immunoglobulin libraries or panels of highly specific binding reagents as 
disclosed in the literature. (See, e.g., Orlandi, R. et al. (1989) Proc. Natl. Acad. Sci. USA 
86:3833-3837; Winter, G. et al. (1991) Nature 349:293-299.) 

25 Antibody fragments which contain specific binding sites for TRICH may also be generated. 

For example, such fragments include, but are not limited to, F(ab') 2 fragments produced by pepsin 
digestion of the antibody molecule and Fab fragments generated by reducing the disulfide bridges of 
the F(ab')2 fragments. Alternatively, Fab expression libraries may be constructed to allow rapid and 
easy identification of monoclonal Fab fragments with the desired specificity. (See, e.g., Huse, W.D. 

30 et al. (1989) Science 246:1275-1281.) 

Various immunoassays may be used for screening.to identify antibodies having the desired 
specificity. Numerous protocols for competitive binding or immunoradiometric assays using either 
polyclonal or monoclonal antibodies with established specificities are well known in the art Such 
immunoassays typically involve the measurement of complex formation between TRICH and its 

35 specific antibody. A two-site, monoclonal-based immunoassay utikzing monoclonal antibodies 
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reactive to two non-interfering TRICH epitopes is generally used, but a competitive binding assay 
may also be employed (Pound, supra) . 

Various methods such as Scatchard analysis in conjunction with radioimmunoassay 
techniques may be used to assess the affinity of antibodies for TRICH. Affinity is expressed as an 

5 association constant, K^, which is defined as the molar concentration of TRICH-antibody complex 
divided by the molar concentrations of free antigen and free antibody under equilibrium conditions. 
The K a determined for a preparation of polyclonal antibodies, which are heterogeneous in their 
affinities for multiple TRICH epitopes, represents the average affinity, or avidity, of the antibodies 
for TRICH. The K, determined for a preparation of monoclonal antibodies, which are monospecific 

10 for a particular TRICH epitope, represents a true measure of affinity. High-affinity antibody 
preparations with K a ranging from about 10 9 to 10 12 L/mole are preferred for use in immunoassays 
in which the TRICH-antibody complex must withstand rigorous manipulations. Low-affinity 
antibody preparations with ranging from about 10 6 to 10 7 L/mole are preferred for use in 
immunopurification and similar procedures which ultimately require dissociation of TRICH, 

15 preferably in active form, from the antibody (Catty, D. (1988) Antibodies, Volume I: A Practical 
Approach , IRL Press, Washington DC; Liddell, J.E. and A. Cryer (1991) A Practical Guide to 
Monoclonal Antibodies . John Wiley & Sons, New York NY). 

The titer and avidity of polyclonal antibody preparations may be further evaluated to 
determine the quality and suitability of such preparations for certain downstream applications. For 

20 example, a polyclonal antibody preparation containing at least 1-2 mg specific antibody/ml, 
preferably 5^10 mg specific antibody/ml, is generally employed in procedures requiring 
precipitation of TRICH-antibody complexes. Procedures for evaluating antibody specificity, titer, 
and avidity, and guidelines for antibody quality and usage in various applications, are generally 
available. (See, e.g., Catty, supra , and Coligan et al. supra .) 

25 In another embodiment of the invention, the polynucleotides encoding TRICH, or any 

fragment or complement thereof, may be used for therapeutic purposes. In one aspect, 
modifications of gene expression can be achieved by designing complementary sequences or 
antisense molecules (DNA, RNA, PNA, or modified oligonucleotides) to the coding or regulatory 
regions of the gene encoding TRICH. Such technology is well known in the art, and antisense 

30 oligonucleotides or larger fragments can be designed from various locations along the coding or 
control regions of sequences encoding TRICH. (See, e.g., Agrawal, S., ed. (1996) Antisense 
Therapeutics , Humana Press Inc., Totawa NJ.) 

In therapeutic use, any gene delivery system suitable for introduction of the antisense 
sequences into appropriate target cells can be used. Antisense sequences can be delivered 

35 intracellularly in the form of an expression plasmid which, upon transcription, produces a sequence 
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complementary to at least a portion of the cellular sequence encoding the target protein. (See, e.g., 
Slater, J.E. et al. (1998) J. Allergy Clin. Immunol. 102(3):469-475; and Scanlon, K.J. et al. (1995) 
9(13): 1288-1296.) Antisense sequences can also be introduced intracellularly through the use of 
viral vectors, such as retrovirus and adeno-associated virus vectors. (See, e.g., Miller, A.D. (1990) 
5 Blood 76:271; Ausubel, supra : Uckert, W. and W. Walther (1994) Pharmacol. Ther. 63(3):323- 
347;) Other gene delivery mechanisms include liposome-derived systems, artificial viral envelopes, 
and other systems known in the art. (See, e.g., Rossi, JJ. (1995) Br. Med. Bull. 51(l):217-225; 
Boado, RJ. et al. (1998) J. Pharm. Sci. 87(11):1308-1315; and Morris, M.C. et al. (1997) Nucleic 
Acids Res. 25(14):2730-2736.) 

10 In another embodiment of the invention, polynucleotides encoding TRICH may be used for 

somatic or germline gene therapy. Gene therapy may be performed to (i) correct a genetic 
deficiency (e.g., in the cases of severe combined immunodeficiency (SCID)-Xl disease ' 
characterized by X-linked inheritance (Cavazzana-Calvo, M. et al. (2000) Science 288:669-672), 
severe combined immunodeficiency syndrome associated with an inherited adenosine deaminase 

15 (ADA) deficiency (Blaese, R.M. et al. (1995) Science 270:475-480; Bordignon, C. et al. (1995) 
Science 270:470-475), cystic fibrosis (Zabner, J. et al. (1993) Cell 75:207-216; Crystal, R.G. et al. 
(1995) Hum. Gene Therapy 6:643-666; Crystal, R.G. et al. (1995) Hum. Gene Therapy 6:667-703), 
thalassamias, familial hypercholesterolemia, and hemophilia resulting from Factor VIII or Factor IX 
deficiencies (Crystal, R.G. (1995) Science 270:404-410; Verma, I.M. and N. Somia (1997) Nature 

20 389:239-242)), (ii) express a conditionally lethal gene product (e.g., in the case of cancers which 
result from unregulated cell proliferation), or (iii) express a protein which affords protection against 
intracellular parasites (e.g., against human retroviruses, such as human immunodeficiency virus 
(HIV) (Baltimore, D. (1988) Nature 335:395-396; Poeschla, E. et al. (1996) Proc. Natl. Acad. Sci. 
USA 93: 1 1395-1 1399), hepatitis B or C virus (HB V, HCV); fungal parasites, such as Candida 

25 albicans and Paracoccidioides brasiliensis: and protozoan parasites such as Plasmodium falciparum 
^ Trypanosoma cruzi). In the case where a genetic deficiency in TRICH expression or regulation 
causes disease, the expression of TRICH from an appropriate population of transduced cells may 
alleviate the clinical manifestations caused by the genetic deficiency. 

In a further embodiment of the invention, diseases or disorders caused by deficiencies in 

30 TRICH are treated by constructing mammalian expression vectors encoding TRICH and introducing 
these vectors by mechanical means into TRICH-deficient cells. Mechanical transfer technologies 
for use with cells in vivo or ex vitro include (i) direct DNA microinjection into individual cells, (ii) 
ballistic gold particle delivery, (iii) liposome-mediated transfection, (iv) receptor-mediated gene 
transfer, and (v) the use of DNA transposons (Morgan, R.A. and W.F. Anderson (1993) Annu. Rev. 

35 Biochem. 62: 191-217; Ivies, Z. (1997) Cell 91:501-510; Boulay, J-L. and H. Recipon (1998) Curr. 
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Opin. Biotechnol. 9:445-450). 

Expression vectors that may be effective for the expression of TRICH include, but are not 
limited to, the PCDNA 3.1, EPITAG, PRCCMV2, PREP, PVAX, PCR2-TOPOTA vectors 
(Invitrogen, Carlsbad CA), PCMV-SCRIPT, PCMV-TAG, PEGSH/PERV (Stratagene, La Jolla 
5 CA), and PTET-OFF, PTET-ON, PTRE2, PTRE2-LUC, PTK-HYG (Clontech, Palo Alto CA). 
TRICH may be expressed using (i) a constitutively active promoter, (e.g., from cytomegalovirus 
(CMV), Rous sarcoma virus (RSV), SV40 virus, thymidine kinase (TK), or p-actin genes), (ii) an 
inducible promoter (e.g., the tetracycline-regulated promoter (Gossen, M. and H. Bujard (1992) 
Proc. Natl. Acad. Sci. USA 89:5547-5551; Gossen, M. et al. (1995) Science 268:1766-1769; Rossi, 

10 F.M.V. and H.M. Blau ( 199S) Curr. Opin. Biotechnol. 9:451-456), commercially available in the 
T-REX plasmid (Invitrpgcn)); the ccdysone-inducible promoter (available in the plasmids PVGRXR 
and PIND; Invitropen); I he FK506/rapamycin inducible promoter; or the RU486/mifepristone 
inducible promoter (Rossi. F.M.V. and H.M. Blau, supra) ), or (iii) a tissue-specific promoter or the 
native promoter of the endogenous gene encoding TRICH from a normal individual. 

15 Commercially available liposome transformation kits (e.g., the PERFECT LIPID 

TRANSFECTION KIT. available from Invitrogen) allow one with ordinary skill in the art to deliver 
polynucleotides to target cells in culture and require minimal effort to optimize experimental 
parameters. In the alternative, transformation is performed using the calcium phosphate method 
(Graham, F.L. and A J. Eb ( 1973) Virology 52:456-467), or by electroporation (Neumann, E. et al. 

20 (1982) EMBO J. 1:841-845). The introduction of DNA to primary cells requires modification of 
these standardized mammalian transfection protocols. 

In another embodiment of the invention, diseases or disorders caused by genetic defects 
with respect to TRICH expression are treated by constructing a retrovirus vector consisting of (i) the 
polynucleotide encoding TRICH under the control of an independent promoter or the retrovirus long 

25 terminal repeat (LTR) promoter, (ii) appropriate RNA packaging signals, and (iii) a Rev-resporisive 
element (RRE) along with additional retrovirus ris-acting RNA sequences and coding sequences 
required for efficient vector propagation. Retrovirus vectors (e.g., PFB and PFBNEO) are 
commercially available (Stratagene) and are based on published data (Riviere, I. et al. (1995) Proc. 
Natl. Acad. Sci. USA 92:6733-6737), incorporated by reference herein. The vector is propagated in 

30 an appropriate vector producing cell line (VPCL) that expresses an envelope gene with a tropism for 
receptors on the target cells or a promiscuous envelope protein such as VSVg (Armentano, D. et al. 
(1987) J. Virol. 61:1647-1650; Bender, M.A. et al. (1987) J. Virol. 61:1639-1646; Adam, M.A. and 
A.D. Miller (1988) J. Virol. 62:3802-3806; Dull, T. et al. (1998) J. Virol. 72:8463-8471; Zufferey, 
R. et al. (1998) I. Virol. 72:9873-9880). U.S. Patent No. 5,910,434 to Rigg ("Method for obtaining 

35 retrovirus packaging cell lines producing high transducing efficiency retroviral supernatant") 
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discloses a method for obtaining retrovirus packaging cell lines and is hereby incorporated by 
reference. Propagation of retrovirus vectors, transduction of a population of cells (e.g., CD4 + T- 
cells), and the return of transduced cells to a patient are procedures well known to persons skilled in 
the art of gene therapy and have been well documented (Ranga, U. et al. (1997) J. Virol. 71:7020- 
5 7029; Bauer, G. et al. (1997) Blood 89:2259-2267; Bonyhadi, M.L. (1997) J. Virol. 71:4707-4716; 
Ranga, U. et al. (1998) Proc. Natl. Acad. Sci. USA 95:1201-1206; Su, L. (1997) Blood 89:2283- 
2290). 

In the alternative, an adenovirus-based gene therapy delivery system is used to deliver 
polynucleotides encoding TRICH to cells which have one or more genetic abnormalities with 

10 respect to the expression of TRICH. The construction and packaging of adenovirus-based vectors 
are well known to those with ordinary skill in the art. Replication defective adenovirus vectors have 
proven to be versatile for importing genes encoding immunoregulatory proteins into intact islets in 
the pancreas (Csete, M.E. et al. (1995) Transplantation 27:263-268). Potentially useful adenoviral 
vectors are described in U.S. Patent No. 5,707,618 to Armentano ("Adenovirus vectors for gene 

15 therapy"), hereby incorporated by reference. For adenoviral vectors, see also Antinozzi, P. A. et al. 
(1999) Annu. Rev. Nutr. 19:511-544 and Verma, LM. and N. Somia (1997) Nature 18:389:239-242, 
both incorporated by reference herein. 

In another alternative, a herpes-based, gene therapy delivery system is used to deliver 
polynucleotides encoding TRICH to target cells which have one or more genetic abnormalities with 

20 respect to the expression of TRICH. The use of herpes simplex virus (HSV)-based vectors may be 
especially valuable for introducing TRICH to cells of the central nervous system, for which HSV 
has a tropism. The construction and packaging of herpes-based vectors are well known to those 
with ordinary skill in the art. A replication-competent herpes simplex virus (HSV) type 1 -based 
vector has been used to deliver a reporter gene to the eyes of primates (Liu, X. et al. (1999) Exp. 

25 Eye Res. 169:385-395). The construction of a HSV-1 virus vector has also been disclosed in detail 
in U.S. Patent No. 5,804,413 to DeLuca ("Herpes simplex virus strains for gene transfer"), which is 
hereby incorporated by reference. U.S. Patent No. 5,804,413 teaches the use of recombinant HSV 
d92 which consists of a genome containing at least one exogenous gene to be transferred to a cell 
under the control of the appropriate promoter for purposes including human gene therapy. Also 

30 taught by this patent are the construction and use of recombinant HSV strains deleted for ICP4, 

ICP27 and ICP22. For HSV vectors, see also Goins, W.F. et al. (1999) J. Virol. 73:519-532 and Xu, 
H. et al. (1994) Dev. Biol. 163:152-161, hereby incorporated by reference. The manipulation of 
cloned herpesvirus sequences, the generation of recombinant virus following the transfection of 
multiple plasmids containing different segments of the large herpesvirus genomes, the growth and 

35 propagation of herpesvirus, and the infection of cells with herpesvirus are techniques well known to 
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those of ordinary skill in the art. 

In another alternative, an alphavirus (positive, single-stranded RNA virus) vector is used to 
deliver polynucleotides encoding TRICH to target cells. The biology of the prototypic alphavirus, 
Semliki Forest Virus (SFV), has been studied extensively and gene transfer vectors have been based 
5 on the SFV genome (Garoff, H. and K.-J. Li (1998) Curr. Opin. Biotechnol. 9:464-469). During 
alphavirus RNA replication, a subgenomic RNA is generated that normally encodes the viral capsid 
proteins. This subgenomic RNA replicates to higher levels than the full length genomic RNA, 
resulting in the overproduction of capsid proteins relative to the viral proteins with enzymatic 
activity (e.g., protease and polymerase). Similarly, inserting the coding sequence for TRICH into 

10 the alphavirus genome in place of the capsid-coding region results in the production of a large 
number of TRICH-coding RNAs and the synthesis of high levels of TRICH in vector transduced 
cells. While alphavirus infection is typically associated with cell lysis within a few days, the ability 
to establish a persistent infection in hamster normal kidney cells (BHK-21) with a variant of Sindbis 
virus (SIN) indicates that the lytic replication of alphaviruses can be altered to suit the needs of the 

15 gene therapy application (Dryga, S.A. et al. (1997) Virology 228:74-83). The wide host range of 
alphaviruses will allow the introduction of TRICH into a variety of cell types. The specific 
transduction of a subset of cells in a population may require the sorting of cells prior to 
transduction. The methods of manipulating infectious cDNA clones of alphaviruses, performing 
alphavirus cDNA and RNA transfections, and performing alphavirus infections, are well known to 

20 those with ordinary skill in the art. 

Oligonucleotides derived from the transcription initiation site, e.g., between about positions 
-10 and +10 from the start site, may also be employed to inhibit gene expression. Similarly, 
inhibition can be achieved using triple helix base-pairing methodology. Triple helix jpairing is 
useful because it causes inhibition of the ability of the double helix to open sufficiently for the 

25 binding of polymerases, transcription factors, or regulatory molecules. Recent therapeutic advances 
using triplex DNA have been described in the literature. (See, e.g., Gee, J.E. et al. (1994) in Huber, 
B.E. and B.I. Carr, Molecular and Immu nologic Approaches . Futura Publishing, Mt. Kisco NY, pp. 
163-177.) A complementary sequence or antisense molecule may also be designed to block 
translation of mRNA by preventing the transcript from binding to ribosomes. 

30 Ribozymes, enzymatic RNA molecules, may also be used to catalyze the specific cleavage 

of RNA. The mechanism of ribozyme action involves sequence-specific hybridization of the 
ribozyme molecule to complementary target RNA, followed by endonucleolytic cleavage. For 
example, engineered hammerhead motif ribozyme molecules may specifically and efficiently 
catalyze endonucleolytic cleavage of sequences encoding TRICH. 

35 Specific ribozyme cleavage sites within any potential RNA target are initially identified by 
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scanning the target molecule for ribozyme cleavage sites, including the following sequences: GUA, 
GUU,and GUC Once identified, short RNA sequences of between 15 and 20 ribonucleotides, 
corresponding to the region of the target gene containing the cleavage site, may be evaluated for 
secondary structural features which may render the oligonucleotide inoperable. The suitability of 
5 candidate targets may also be evaluated by testing accessibility to hybridization with 
complementary oligonucleotides using ribonuclease protection assays. 

Complementary ribonucleic acid molecules and ribozymes of the invention may be prepared 
by any method known in the art for the synthesis of nucleic acid molecules. These include 
techniques for chemically synthesizing oligonucleotides such as solid phase phosphoramidite 
10 chemical synthesis. Alternatively, RNA molecules may be generated by in vitro and in vivo 

transcription of DNA sequences encoding TRICH. Such DNA sequences may be incorporated into 
a wide variety of vectors with suitable RNA polymerase promoters such as T7 or SP6. ■ ' 

Alternatively, these cDNA constructs that synthesize complementary RNA, constitutively or 
inducibly, can be introduced into cell lines, cells, or tissues. 

15 RNA molecules may be modified to increase intracellular stability and half-life. Possible 

modifications include, hut are not limited to, the addition of flanking sequences at the 5' and/or 3' 
ends of the molecule, or the use of phosphorothioate or 2'O-methyl rather than phosphodiesterase 
linkages within the backbone of the molecule. This concept is inherent in the production of PNAs 
and can be extended in all of these molecules by the inclusion of nontraditional bases such as 

20 inosine, queosine, and wybutosine, as well as acetyl-, methyl-, thio-, and similarly modified forms 
of adenine, cytidinc, guanine, thymine, and uridine which are not as easily recognized by 
endogenous endonuclcases. 

An additional embodiment of the invention encompasses a method for screening for a 
compound which is effective in altering expression of a polynucleotide encoding TRICH. 

25 Compounds which may be effective in altering expression of a specific polynucleotide may include, 
but are not limited to, oligonucleotides, antisense oligonucleotides, triple helix-forming 
oligonucleotides, transcription factors and other polypeptide transcriptional regulators, and non- 
macromolecular chemical entities which are capable of interacting with specific polynucleotide 
sequences. Effective compounds may alter polynucleotide expression by acting as either inhibitors 

30 or promoters of polynucleotide expression. Thus, in the treatment of disorders associated with 
increased TRICH expression or activity, a compound which specifically inhibits expression of the 
polynucleotide encoding TRICH may be therapeutically useful, and in the treatment of disorders 
associated with decreased TRICH expression or activity, a compound which specifically promotes 
expression of the polynucleotide encoding TRICH may be therapeutically useful. 

35 At least one > up to a plurality, of test compounds may be screened for effectiveness in 
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altering expression of a specific polynucleotide. A test compound may be obtained by any method 
commonly known in the art, including chemical modification of a compound known to be effective 
in altering polynucleotide expression; selection from an existing, commercially-available or 
proprietary library of naturally-occurring or non-natural chemical compounds; rational design of a 
5 compound based on chemical and/or structural properties of the target polynucleotide; and selection 
from a library of chemical compounds created combinatorially or randomly. A sample comprising 
a polynucleotide encoding TRICH is exposed to at least one test compound thus obtained. The 
sample may comprise, for example, an intact or permeabilized cell, or an in vitro cell-free or 
reconstituted biochemical system. Alterations in the expression of a polynucleotide encoding 

10 TR1C1 1 arc assayed by any method commonly known in the art. Typically, the expression of a 
specific nucleotide is detected by hybridization with a probe having a nucleotide sequence 
complementary to the sequence of the polynucleotide encoding TRICH. The amount of ' 
hybridization may be quantified, thus forming the basis for a comparison of the expression of the 
polynucleotide both with and without exposure to one or more test compounds. Detection of a 

15 change in the expression of a polynucleotide exposed to a test compound indicates that the test 
compound is effective in altering the expression of the polynucleotide. A screen for a compound 
effective in altering expression of a specific polynucleotide can be carried out, for example, using a 
Schizosaccharomvces pombe gene expression system (Atkins, D. et al. (1999) U.S. Patent No. 
5,932,435; Arndt, G.M. et al. (2000) Nucleic Acids Res. 28:E15) or a human cell line such as HeLa 

20 cell (Clarke, M.L. et al. (2000) Biochem. Biophys. Res. Commun. 268:8-13). A particular 

embodiment of the present invention involves screening a combinatorial library of oligonucleotides 
(such as deoxyribonucleotides, ribonucleotides, peptide nucleic acids, and modified 
oligonucleotides) for antisense activity against a specific polynucleotide sequence (Bruice, T.W. et 
al. (1997) U.S. Patent No. 5,686,242; Bruice, T^W. et al. (2000) U.S. Patent No. 6,022,691). 

25 Many methods for introducing vectors into cells or tissues are available and equally suitable 

for use in vivo , in vitro , and ex vivo . For ex vivo therapy, vectors may be introduced into stem cells 
taken from the patient and clonally propagated for autologous transplant back into that same patient. 
Delivery by transfection, by liposome injections, or by polycationic amino polymers may be 
achieved using methods which are well known in the art. (See, e.g., Goldman, C.K. et al. (1997) 

30 Nat. Biotechnol. 15:462-466.) 

Any of the therapeutic methods described above may be applied to any subject in need of 
such therapy, including, for example, mammals such as humans, dogs, cats, cows, horses, rabbits, 
and monkeys. 

An additional embodiment of the invention relates to the administration of a composition 
35 which generally comprises an active ingredient formulated with a phannaceutically acceptable 
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excipient. Excipients may include, for example, sugars, starches, celluloses, gums, and proteins. 
Various formulations are commonly known and are thoroughly discussed in the latest edition of 
Remington's Pharmaceutical Sciences (Maack Publishing, Easton PA). Such compositions may 
consist of TRICH, antibodies to TRICH, and mimetics, agonists, antagonists, or inhibitors of 
5 TRICH. 

- The compositions utilized in this invention may be administered by any number of routes 
including, but not limited to, oral, intravenous, intramuscular, intra-arterial, intramedullary, 
intrathecal, intraventricular, pulmonary, transdermal, subcutaneous, intraperitoneal, intranasal, 
enteral, topical, sublingual, or rectal means. 

10 Compositions for pulmonary administration may be prepared in liquid or dry powder form. 

These compositions are generally aerosolized immediately prior to inhalation by the patient. In the 
case of small molecules (e.g. traditional low molecular weight organic drugs), aerosol delivery of ' 
fast-acting formulations is well-known in the art. In the case of macromolecules (e.g. larger 
peptides and proteins), recent developments in the field of pulmonary delivery via the alveolar 

15 region of the lung have enabled the practical delivery of drugs such as insulin to blood circulation 
(see, e.g., Pattoh, J.S. et al., U.S. Patent No. 5,997,848). Pulmonary delivery has the advantage of 
administration without needle injection, and obviates the need for potentially toxic penetration 
enhancers. 

Compositions suitable for use in the invention include compositions wherein the active 
20 ingredients are contained in an effective amount to achieve the intended purpose. The 
determination of an effective dose is well within the capability of those skilled in the art. 

Specialized forms of compositions may be prepared for direct intracellular delivery of 
macromolecules comprising TRICH or fragments thereof. For example, liposome preparations 
containing a cell-impermeable macromolecule may promote cell fusion and intracellular delivery of 
25 the macromolecule. Alternatively, TRICH or a fragment thereof may be joined to a short cationic 
N-terminal portion from the HTV Tat-1 protein. Fusion proteins thus generated have been found to 
transduce into the cells of all tissues, including the brain, in a mouse model system (Schwarze, S.R. 
et al. (1999) Science 285:1569-1572). 

For any compound, the therapeutically effective dose can be estimated initially either in cell 
30 culture assays, e.g., of neoplastic cells, or in animal models such as mice, rats, rabbits, dogs, 

monkeys, or pigs. An animal model may also be used to determine the appropriate concentration 
range and route of administration. Such information can then be used to determine useful doses and 
routes for administration in humans. 

A therapeutically effective dose refers to that amount of active ingredient, for example 
35 TRICH or fragments thereof, antibodies of TRICH, and agonists, antagonists or inhibitors of 
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TRICH, which ameliorates the symptoms or condition. Therapeutic efficacy and toxicity may be 
determined by standard pharmaceutical procedures in cell cultures or with experimental animals, 
such as by calculating the ED 50 (the dose therapeutically effective in 50% of the population) or LD 50 
(the dose lethal to 50% of the population) statistics. The dose ratio of toxic to therapeutic effects is 
5 the therapeutic index, which can be expressed as the LD 50 /ED 50 ratio. Compositions which exhibit 
large therapeutic indices are preferred. The data obtained from cell culture assays and animal 
studies are used to formulate a range of dosage for human use. The dosage contained in such 
compositions is preferably within a range of circulating concentrations that includes the ED 50 with 
little or no toxicity. The dosage varies within this range depending upon the dosage form employed, 

10 the sensitivity of the patient, and the route of administration. 

The exact dosage will be determined by the practitioner, in light of factors related to the 
subject requiring treatment. Dosage and administration are adjusted to provide sufficient levels of 
the active moiety or to maintain the desired effect. Factors which may be taken into account include 
the severity of the disease state, the general health of the subject, the age, weight, and gender of the 

15 subject, time and frequency of administration, drug combination(s), reaction sensitivities, and 

response to therapy. Long-acting compositions may be administered every 3 to 4 days, every week, 
or biweekly depending on the half-life and clearance rate of the particular formulation. 

Normal dosage amounts may vary from about 0.1 fxg to 100,000 ^g, up to a total dose of 
about 1 gram, depending upon the route of administration. Guidance as to particular dosages and 

20 methods of delivery is provided in the literature and generally available to practitioners in the art. 
Those skilled in the art will employ different formulations for nucleotides than for proteins or their 
inhibitors. Similarly, delivery of polynucleotides or polypeptides will be specific to particular cells, 
conditions, locations, etc. 
DIAGNOSTICS 

25 In another embodiment, antibodies which specifically bind TRICH may be used for the 

diagnosis of disorders characterized by expression of TRICH, or in assays to monitor patients being 
treated with TRICH or agonists, antagonists, or inhibitors of TRICH. Antibodies useful for 
diagnostic purposes may be prepared in the same manner as described above for therapeutics. 
Diagnostic assays for TRICH include methods which utilize the antibody and a label to detect 

30 TRICH in human body fluids or in extracts of cells or tissues. The antibodies may be used with or 
without modification, and may be labeled by covalent or non-covalent attachment of a reporter 
molecule. A wide variety of reporter molecules, several of which are described above, are known in 
the art and may be used. 

A variety of protocols for measuring TRICH, including ELISAs, RIAs, and FACS, are 

35 known in the art and provide a basis for diagnosing altered or abnormal levels of TRICH expression. 
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Normal or standard values for TRICH expression are established by combining body fluids or cell 
extracts taken from normal mammalian subjects, for example, human subjects, with antibodies to 
TRICH under conditions suitable for complex formation. The amount of standard complex 
formation may be quantitated by various methods, such as photometric means. Quantities of 
5 TRICH expressed in subject, control, and disease samples from biopsied tissues are compared with 
the standard values. Deviation between standard and subject values establishes the parameters for 
diagnosing disease. 

In another embodiment of the invention, the polynucleotides encoding TRICH may be used 
for diagnostic purposes. The polynucleotides which may be used include oligonucleotide 
10 sequences, complementary RNA and DNA molecules, and PNAs. The polynucleotides may be used 
to detect and quantify gene expression in biopsied tissues in which expression of TRICH may be 
correlated with disease. The diagnostic assay may be used to determine absence, presence, and ^ 
excess expression of TRICH, and to monitor regulation of TRICH levels during therapeutic 
intervention. 

15 In one aspect, hybridization with PCR probes which are capable of detecting polynucleotide 

sequences, including genomic sequences, encoding TRICH or closely related molecules may be 
used to identify nucleic acid sequences which encode TRICH. The specificity of the probe, whether 
it is made from a highly specific region, e.g., the 5' regulatory region, or from a less specific region, 
e.g., a conserved motif, and the stringency of the hybridization or amplification will determine 

20 whether the probe identifies only naturally occurring sequences encoding TRICH, allelic variants, 
or related sequences. 

Probes may also be used for the detection of related sequences, and may have at least 50% 
sequence identity to any of the TRICH encoding sequences. The hybridization probes of the subject 
invention may be DNA or RNA and may be derived from the sequence of SEQ ID NO:33-64 or 
25 from genomic sequences including promoters, enhancers, and introns of the TRICH gene. 

Means for producing specific hybridization probes for DNAs encoding TRICH include the 
cloning of polynucleotide sequences encoding TRICH or TRICH derivatives into vectors for the 
production of mRNA probes. Such vectors are known" in the art, are commercially available, and 
may be used to synthesize RNA probes in vitro by means of the addition of the appropriate RNA 
30 polymerases and the appropriate labeled nucleotides. Hybridization probes may be labeled by a 
variety of reporter groups, for example, by radionuclides such as 32 P or 35 S, or by enzymatic labels, 
such as alkaline phosphatase coupled to the probe via avidin/biotin coupling systems, and the like. 

Polynucleotide sequences encoding TRICH may be used for the diagnosis of disorders 
associated with expression of TRICH. Examples of such disorders include, but are not limited to, a 
35 transport disorder such as akinesia, amyotrophic lateral sclerosis, ataxia telangiectasia, cystic 
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fibrosis, Becker's muscular dystrophy, Bell's palsy, Charcot-Marie Tooth disease, diabetes mellitus, 
diabetes insipidus, diabetic neuropathy, Duchenne muscular dystrophy, hyperkalemic periodic 
paralysis, normokalemic periodic paralysis, Parkinson's disease, malignant hyperthermia, multidrug 
resistance, myasthenia gravis, myotonic dystrophy, catatonia, tardive dyskinesia, dystonias, 

5 peripheral neuropathy, cerebral neoplasms, prostate cancer, cardiac disorders associated with 
transport, e.g., angina, bradyarrythmia, tachyarrythmia, hypertension, Long QT syndrome, 
myocarditis, cardiomyopathy, nemaline myopathy, centronuclear myopathy, lipid myopathy, 
mitochondrial myopathy, thyrotoxic myopathy, ethanol myopathy, dermatomyositis, inclusion body 
myositis, infectious myositis, polymyositis, neurological disorders associated with transport, e.g., 

10 Alzheimer's disease, amnesia, bipolar disorder, dementia, depression, epilepsy, Tourette's disorder, 
paranoid psychoses, and schizophrenia, and other disorders associated with transport, e.g., , 
neurofibromatosis, postherpetic neuralgia, trigeminal neuropathy, sarcoidosis, sickle cell anemia, ^ 
Wilson's disease, cataracts, infertility, pulmonary artery stenosis, sensorineural autosomal deafness, 
hyperglycemia, hypoglycemia, Grave's disease, goiter, Cushing's disease, Addison's disease, 

15 glucose-galactose malabsoiption syndrome, glycogen storage disease, hypercholesterolemia, 

adrenoleukodystrophy, Zellweger syndrome, Menkes disease, occipital horn syndrome, von Gierke 
disease, pseudohypoaldosteronism type 1, Liddle's syndrome, cystinuria, iminoglycinuria, Hartup 
disease, Fanconi disease, and Bartter syndrome; a neurological disorder such as epilepsy, ischemic 
cerebrovascular disease, stroke, cerebral neoplasms, Alzheimer's disease, Pick's disease, 

20 Huntington's disease, dementia, Parkinson's disease and other extrapyramidal disorders, 
amyotrophic lateral sclerosis and other motor neuron disorders, progressive neural muscular 
atrophy, retinitis pigmentosa, hereditary ataxias, multiple sclerosis and other demyelinating 
diseases, bacterial and viral meningitis, brain abscess, subdural empyema, epidural abscess, 
suppurative intracranial thrombophlebitis, myelitis and radiculitis, viral central nervous system 

25 disease, prion diseases including kuru, Creutzfeldt- Jakob disease, and Gerstmann- 

Straussler-Scheinker syndrome, fatal familial insomnia, nutritional and metabolic diseases of the 
nervous system, neurofibromatosis, tuberous sclerosis, cerebelloretinal hemangioblastomatosis, 
encephalotrigeminal syndrome, mental retardation and other developmental disorders of the central 
nervous system including Down syndrome, cerebral palsy, neuroskeletal disorders, autonomic 

30 nervous system disorders, cranial nerve disorders, spinal cord diseases, muscular dystrophy and 
other neuromuscular disorders, peripheral nervous system disorders, dermatomyositis and 
polymyositis, inherited, metabolic, endocrine, and toxic myopathies, myasthenia gravis, periodic 
paralysis, mental disorders including mood, anxiety, and schizophrenic disorders, seasonal affective 
disorder (SAD), akathesia, amnesia, catatonia, diabetic neuropathy, hemiplegic migraine, tardive 

35 dyskinesia, dystonias, paranoid psychoses, postherpetic neuralgia, Tourette's disorder, progressive 
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supranuclear palsy, corticobasal degeneration, and familial frontotemporal dementia; a muscle 
disorder such as cardiomyopathy, myocarditis, Duchenne's muscular dystrophy, Becker's muscular 
dystrophy, myotonic dystrophy, central core disease, nemaline myopathy, centronuclear myopathy, 
lipid myopathy, mitochondrial myopathy, infectious myositis, polymyositis, dermatomyositis, 
5 inclusion body myositis, thyrotoxic myopathy, ethanol myopathy, angina, anaphylactic shock, 
arrhythmias, asthma, cardiovascular shock, Cushing's syndrome, hypertension, hypoglycemia, 
myocardial infarction, migraine, pheochromocytoma, and myopathies including encephalopathy, 
epilepsy, Kearns-Sayre syndrome, lactic acidosis, myoclonic disorder, ophthalmoplegia, acid 
maltase deficiency (AMD, also known as Pompe's disease), generalized myotonia, and myotonia 

10 congenita; an immunological disorder such as acquired immunodeficiency syndrome (AIDS), 
Addison's disease, adult respiratory distress syndrome, allergies, ankylosing spondylitis, 
amyloidosis, anemia, asthma, atherosclerosis, autoimmune hemolytic anemia, autoimmune 
thyroiditis, autoimmune polyendocrinopathy-candidiasis-ectodermal dystrophy (APECED), 
bronchitis, cholecystitis, contact dermatitis, Crohn's disease, atopic dermatitis, dermatomyositis, 

15 diabetes mellitus, emphysema, episodic lymphopenia with lymphocytotoxins, erythroblastosis 
fetalis, erythema nodosum, atrophic gastritis, glomerulonephritis, Goodpasture's syndrome, gout, 
Graves' disease, Hashimoto's thyroiditis, hypereosinophilia, irritable bowel syndrome, multiple 
sclerosis, myasthenia gravis, myocardial or pericardial inflammation, osteoarthritis, osteoporosis, 
pancreatitis, polymyositis, psoriasis, Reiter's syndrome, rheumatoid arthritis, scleroderma, 

20 Sjogren's syndrome, systemic anaphylaxis, systemic lupus erythematosus, systemic sclerosis, 
thrombocytopenic purpura, ulcerative colitis, uveitis, Werner syndrome, complications of cancer, 
hemodialysis, and extracorporeal circulation, viral, bacterial, fungal, parasitic, protozoal, and 
helminthic infections, and trauma; and a cell proliferative disorder such as actinic keratosis, 
arteriosclerosis, atherosclerosis, bursitis, cirrhosis, hepatitis, mixed connective tissue disease 

25 (MCTD), myelofibrosis, paroxysmal nocturnal hemoglobinuria, polycythemia vera, psoriasis, 

primary thrombocythemia, and cancers including adenocarcinoma, leukemia, lymphoma, melanoma, 
myeloma, sarcoma, teratocarcinoma, and, in particular, cancers of the adrenal gland, bladder, bone, 
bone marrow, brain, breast, cervix, gall bladder, ganglia, gastrointestinal tract, heart, kidney, liver, 
lung, muscle, ovary, pancreas, parathyroid, penis, prostate, salivary glands, skin, spleen, testis, 

30 thymus, thyroid, and uterus. The polynucleotide sequences encoding TRICH may be used in 
Southern or northern analysis, dot blot, or other membrane-based technologies; in PCR 
technologies; in dipstick, pin, and multiformat ELISA-like assays; and in microarrays utilizing 
fluids or tissues from patients to detect altered TRICH expression. Such qualitative or quantitative 
methods are well known in the art. 

35 a particular aspect, the nucleotide sequences encoding TRICH may be useful in assays 
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that detect the presence of associated disorders, particularly those mentioned above. The nucleotide 
sequences encoding TRICH may be labeled by standard methods and added to a fluid or tissue 
sample from a patient under conditions suitable for the formation of hybridization complexes. After 
a suitable incubation period, the sample is washed and the signal is quantified and compared with a 
5 standard value. If the amount of signal in the patient sample is significantly altered in comparison 
to a control sample then the presence of altered levels of nucleotide sequences encoding TRICH in 
the sample-indicates the presence of the associated disorder. Such assays may also be used to 
evaluate the efficacy of a particular therapeutic treatment regimen in animal studies, in clinical 
trials, or to monitor the treatment of an individual patient. 

10 In order to provide a basis for the diagnosis of a disorder associated with expression of 

TRICH, a normal or standard profile for expression is established. This may be accomplished by 
combining body fluids or cell extracts taken from normal subjects, either animal or human, with a. ✓ 
sequence, or a fragment thereof, encoding TRICH, under conditions suitable for hybridization or 
amplification. Standard hybridization may be quantified by comparing the values obtained from 

15 normal subjects with values from an experiment in which a known amount of a substantially 
purified polynucleotide is used. Standard values obtained in this manner may be compared with 
values obtained from samples from patients who are symptomatic for a disorder. Deviation from 
standard values is used to establish the presence of a disorder. 

Once the presence of a disorder is established and a treatment protocol is initiated, 

20 hybridization assays may be repeated on a regular basis to determine if the level of expression in the 
patient begins to approximate that which is observed in the normal subject. The results obtained 
from successive assays may be used to show the efficacy of treatment over a period ranging from 
several days to months. 

With respect to cancer, the presence of an abnormal amount of transcript (either under- or 

25 overexpressed) in biopsied tissue from an individual may indicate a predisposition for the 
development of the disease, or may provide a means for detecting the disease prior to the 
appearance of actual clinical symptoms. A more definitive diagnosis of this type may allow health 
professionals to employ preventative measures or aggressive treatment earlier thereby preventing 
the development or further progression of the cancer. 

30 Additional diagnostic uses for oligonucleotides designed from the sequences encoding 

TRICH may involve the use of PCR. These oligomers may be chemically synthesized, generated 
enzymatically, or produced in vitro . Oligomers will preferably contain a fragment of a 
polynucleotide encoding TRICH, or a fragment of a polynucleotide complementary to the 
polynucleotide encoding TRICH, and will be employed under optimized conditions for 

35 identification of a specific gene or condition. Oligomers may also be employed under less stringent 
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conditions for detection or quantification of closely related DNA or RNA sequences. 

In a particular aspect, oligonucleotide primers derived from the polynucleotide sequences 
encoding TRICH may be used to detect single nucleotide polymorphisms (SNPs). SNPs are 
substitutions, insertions and deletions that are a frequent cause of inherited or acquired genetic 
5 disease in humans. Methods of SNP detection include, but are not limited to, single-stranded 
conformation polymorphism (SSCP) and fluorescent SSCP (fSSCP) methods. In SSCP, 
oligonucleotide primers derived from the polynucleotide sequences encoding TRICH are used to 
amplify DNA using the polymerase chain reaction (PCR). The DNA may be derived, for example, 
from diseased or normal tissue, biopsy samples, bodily fluids, and the like. SNPs in the DNA cause 

10 differences in the secondary and tertiary structures of PCR products in single-stranded form, and 
these differences are detectable using gel electrophoresis in non-denaturing gels. In fSCCP, the 
oligonucleotide primers are fluorescently labeled, which allows detection of the amplimers in high* 
throughput equipment such as DNA sequencing machines. Additionally, sequence database 
analysis methods, termed in silico SNP (isSNP), are capable of identifying polymorphisms by 

15 comparing the sequence of individual overlapping DNA fragments which assemble into a common 
consensus sequence. These computer-based methods filter out sequence variations due to 
laboratory preparation of DNA and sequencing errors using statistical models and automated 
analyses of DNA sequence chromatograms. In the alternative, SNPs may be detected and 
characterized by mass spectrometry using, for example, the high throughput MASSARRAY system 

20 (Sequenom, Inc., San Diego CA). 

Methods which may also be used to quantify the expression of TRICH include radiolabeling 
or biotinylating nucleotides, coamplification of a control nucleic acid, and interpolating results from 
standard curves. (See, e.g., Melby, P.C. et al. (1993) J. Immunol. Methods 159:235-244; Duplaa, C. 
et al. (1993) Anal. Biochem. 212:229-236.) The speed of quantitation of multiple samples may be 

25 accelerated by running the assay in a high-throughput format where the oligomer or polynucleotide 
of interest is presented in various dilutions and a spectrophotometric or colorimetric response gives 
rapid quantitation. 

In further embodiments, oligonucleotides or longer fragments derived from any of the 
polynucleotide sequences described herein may be used as elements on a microarray. The 

30 microarray can be used in transcript imaging techniques which monitor the relative expression 
levels of large numbers of genes simultaneously as described below. The microarray may also be 
used to identify genetic variants, mutations, and polymorphisms. This information may be used to 
determine gene function, to understand the genetic basis of a disorder, to diagnose a disorder, to . 
monitor progression/regression of disease as a function of gene expression, and to develop and 

35 monitor the activities of therapeutic agents in the treatment of disease. In particular, this 
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information may be used to develop a pharmacogenomic profile of a patient in order to select the 
most appropriate and effective treatment regimen for that patient. For example, therapeutic agents 
which are highly effective and display the fewest side effects may be selected for a patient based on 
his/her pharmacogenomic profile. 
5 In another embodiment, TRICH, fragments of TRICH, or antibodies specific for TRICH 

may be used as elements on a microarray. The microarray may be used to monitor or measure 
protein-protein interactions, drug-target interactions, and gene expression profiles, as described 
above. 

A particular embodiment relates to the use of the polynucleotides of the present invention to 

10 generate a transcript image of a tissue or cell type. A transcript image represents the global pattern 
of gene expression by a particular tissue or cell type. Global gene expression patterns are analyzed 
by quantifying the number of expressed genes and their relative abundance under given conditions ' 
and at a given time. (See Seilhamer et al., "Comparative Gene Transcript Analysis," U.S. Patent 
No. 5,840,484, expressly incorporated by reference herein.) Thus a transcript image may be 

15 generated by hybridizing the polynucleotides of the present invention or their complements to the 
totality of transcripts or reverse transcripts of a particular tissue or cell type. In one embodiment, 
the hybridization takes place in high-throughput format, wherein the polynucleotides of the present 
invention or their complements comprise a subset of a plurality of elements on a microarray. The 
resultant transcript image would provide a profile of gene activity. 

20 Transcript images may be generated using transcripts isolated from tissues, cell lines, 

biopsies, or other biological samples. The transcript image may thus reflect gene expression in 
vivo, as in the case of a tissue or biopsy sample, or in vitro , as in the case of a cell line. 

Transcript images which profile the expression of the polynucleotides of the present 
invention may also be used in conjunction with in vitro model systems and preclinical evaluation of 

25 pharmaceuticals, as well as toxicological testing of industrial and naturally-occurring environmental 
compounds. All compounds induce characteristic gene expression patterns, frequently termed 
molecular fingerprints or toxicant signatures, which are indicative of mechanisms of action and 
toxicity (Nuwaysir, E.F. et al. (1999) Mol. Carcinog. 24:153-159; Steiner, S. and N.L. Anderson 
(2000) Toxicol. Lett. 1 12-1 13:467-471, expressly incorporated by reference herein). If a test 

30 compound has a signature similar to that of a compound with known toxicity, it is likely to share 
those toxic properties. These fingerprints or signatures are most useful and refined when they 
contain expression information from a large number of genes and gene families. Ideally, a genome- 
wide measurement of expression provides the highest quality signature. Even genes whose 
expression is not altered by any tested compounds are important as well, as the levels of expression 

35 of these genes are used to normalize the rest of the expression data. The normalization procedure is 
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useful for comparison of expression data after treatment with different compounds. While the 
assignment of gene function to elements of a toxicant signature aids in interpretation of toxicity 
mechanisms, knowledge of gene function is not necessary for the statistical matching of signatures 
which leads to prediction of toxicity. (See, for example, Press Release 00-02 from the National 
5 Institute of Environmental Health Sciences, released February 29, 2000, available at 

http://www.niehs.nih.gov/oc/news/toxchip.htm.) Therefore, it is important and desirable in 
toxicological screening using toxicant signatures to include all expressed gene sequences. 

In one embodiment, the toxicity of a test compound is assessed by treating a biological 
sample containing nucleic acids with the test compound. Nucleic acids that are expressed in the 

10 treated biological sample are hybridized with one or more probes specific to the polynucleotides of 
the present invention, so that transcript levels corresponding to the polynucleotides of the present 
invention may be quantified. The transcript levels in the treated biological sample are compared ' 
with levels in an untreated biological sample. Differences in the transcript levels between -the two 
samples are indicative of a toxic response caused by the test compound in the treated sample. 

15 Another particular embodiment relates to the use of the polypeptide sequences of the 

present invention to analyze the proteome of a tissue or cell type. The term proteome refers to the 
global pattern of protein expression in a particular tissue or cell type. Each protein component of a 
proteome can be subjected individually to further analysis. Proteome expression patterns, or 
profiles, are analyzed by quantifying the number of expressed proteins and their relative abundance 

20 under given conditions and at a given time. A profile of a cell's proteome may thus be generated by 
separating and analyzing the polypeptides of a particular tissue or cell type. In one embodiment, the 
separation is achieved using two-dimensional gel electrophoresis, in which proteins from a sample 
are separated by isoelectric focusing in the first dimension, and then according to molecular weight 
by sodium dodecyl sulfate slab gel electrophoresis in the second dimension (Steiner and Anderson, 

25 supra) . The proteins are visualized in the gel as discrete and uniquely positioned spots, typically by 
staining the gel with an agent such as Coomassie Blue or silver or fluorescent stains. The optical 
density of each protein spot is generally proportional to the level of the protein in the sample. The 
optical densities of equivalently positioned protein spots from different samples, for example, from 
biological samples either treated or untreated* with a test compound or therapeutic agent, are 

30 compared to identify any changes in protein spot density related to the treatment. The proteins in 
the spots are partially sequenced using, for example, standard methods employing chemical or 
enzymatic cleavage followed by mass spectrometry. The identity of the protein in a spot may be 
determined by comparing its partial sequence, preferably of at least 5 contiguous amino acid 
residues, to the polypeptide sequences of the present invention. In some cases, further sequence 

35 data may be obtained for definitive protein identification. 
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A proteomic profile may also be generated using antibodies specific for TRICH to quantify 
the levels of TRICH expression. In one embodiment, the antibodies are used as elements on a 
microarray, and protein expression levels are quantified by exposing the microarray to the sample 
and detecting the levels of protein bound to each array element (Lueking, A. et al. (1999) Anal. 
5 Biochem. 270: 103-1 1 1 ; Mendoze, L.G. et al. (1999) Biotechniques 27:778-788). Detection may be 
performed by a variety of methods known in the art, for example, by reacting the proteins in the 
sample with a thiol- or ammo-reactive fluorescent compound and detecting the amount of 
fluorescence bound at each array element. 

Toxicant signatures at the proteome level are also useful for toxicological screening, and 

10 should be analyzed in parallel with toxicant signatures at the transcript level. There is a poor 

correlation between transcript and protein abundances for some proteins in some tissues (Anderson, 
N.L. and J. Scilhamer ( 1997) Electrophoresis 18:533-537), so proteome toxicant signatures may be' 
useful in the analysis of compounds which do not significantly affect the transcript image, but 
which alter the proteomic profile. In addition, the analysis of transcripts in body fluids is difficult, 

15 due to rapid degradation of niRN A, so proteomic profiling may be more reliable and informative in 
such cases. 

In another embodiment, the toxicity of a test compound is assessed by treating a biological 
sample containing proteins with the test compound. Proteins that are expressed in the treated 
biological sample arc separated so that the amount of each protein can be quantified. The amount of 

20 each protein is compared to the amount of the corresponding protein in an untreated biological 
sample. A difference in the amount of protein between the two samples is indicative of a toxic 
response to the test compound in the treated sample. Individual proteins are identified by 
sequencing the amino acid residues of the individual proteins and comparing these partial sequences 
to the polypeptides of the present invention. 

25 In another embodiment, the toxicity of a test compound is assessed by treating a biological 

sample containing proteins with the test compound. Proteins from the biological sample are 
incubated with antibodies specific to the polypeptides of the present invention. The amount of 
protein recognized by the antibodies is quantified. The amount of protein in the treated biological 
sample is compared with the amount in an untreated biological sample. A difference in the amount 

30 of protein between the two samples is indicative of a toxic response to the test compound in the 
treated sample. 

Microarrays may be prepared, used, and analyzed using methods known in the art. (See, 
e.g., Brennan, T.NL et al. (1995) U.S. Patent No. 5,474,796; Schena, M. et al. (1996) Proc. Natl. 
Acad. Sci. USA 93:10614-10619; Baldeschweiler et al. (1995) PCT application W095/251116; 
35 Shalon, D. et al. (1995) PCT application WO95/35505; Heller, R.A. et al. (1997) Proc. Natl. Acad. 
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Sci. USA 94:2150-2155; and Heller, M.L et al. (1997) U.S. Patent No. 5,605,662.) Various types of 
microarrays are well known and thoroughly described in DNA Microarravs: A Practical App roach. 
M. Schena, ed. (1999) Oxford University Press, London, hereby expressly, incorporated by 
reference. 

5 , In another embodiment of the invention, nucleic acid sequences encoding TRICH may be 
used to generate hybridization probes useful in mapping the naturally occurring genomic sequence. 
Either coding or noncoding sequences may be used, and in some instances, noncoding sequences 
may be preferable over coding sequences. For example, conservation of a coding sequence among 
members of a multi-gene family may potentially cause undesired cross hybridization dining 

10 chromosomal mapping. The sequences may be mapped to a particular chromosome, to a specific 
region of a chromosome, or to artificial chromosome constructions, e.g., human artificial 
chromosomes (HACs), yeast artificial chromosomes (YACs), bacterial artificial chromosomes 
(BACs), bacterial PI constructions, or single chromosome cDNA libraries. (See, e.g., Harrington, 
J.J. et al. (1997) Nat. Genet. 15:345-355; Price, CM. (1993) Blood Rev. 7:127-134; and Trask, B J. 

15 (1991) Trends Genet. 7:149-154.) Once mapped, the nucleic acid sequences of the invention may 
be used to develop genetic linkage maps, for example, which correlate the inheritance of a disease 
state with the inheritance of a particular chromosome region or restriction fragment length 
polymorphism (RFLP). (See, for example, Lander, E.S. and D. Botstein (1986) Proc. Natl. Acad. 
Sci. USA 83:7353-7357.) 

20 Fluorescent in situ hybridization (FISH) may be correlated with other physical and genetic 

map data. (See, e.g.,'Heinz-Ulrich, et al. (1995) in Meyers, supra , pp. 965-968.) Examples of 
genetic map data can be found in various scientific journals or at the Online Mendelian Inheritance 
in Man (OMIM) World Wide Web site. Correlation between the location of the gene encoding 
TRICH on a physical map and a specific disorder, or a predisposition to a specific disorder, may 

25 help define the region of DNA associated with that disorder and thus may further positional cloning 
efforts. 

tositu hybridization of chromosomal preparations and physical mapping techniques, such as 
linkage analysis using established chromosomal markers, may be used for extending genetic maps. 
Often the placement of a gene on the chromosome of another mammalian species, such as mouse, 

30 may reveal associated markers even if the exact chromosomal locus is not known. This information 
is valuable to investigators searching for disease genes using positional cloning or other gene ' 
discovery techniques. Once the gene or genes responsible for a disease or syndrome have been 
crudely localized by genetic linkage to a particular genomic region, e.g., ataxia-telangiectasia to 
llq22-23, any sequences mapping to that area may represent associated or regulatory genes for 

35 further investigation. (See, e.g., Gatti, R.A. et al. (1988) Nature 336:577-580.) The nucleotide 
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sequence of the instant invention may also be used to detect differences in the chromosomal 

location due to translocation, inversion, etc., among normal, carrier, or affected individuals. 

In another embodiment of the invention, TRICH, its catalytic or immunogenic fragments, or 

oligopeptides thereof can be used for screening libraries of compounds in any of a variety of drug 
5 screening techniques. The fragment employed in such screening may be free in solution, affixed to 

a solid support, borne on a cell surface, or located intracellularly. The formation of binding 

complexes between TRICH and the agent being tested may be measured. 

Another technique for drug screening provides for high throughput screening of compounds 

having suitable binding affinity to the protein of interest. (See, e.g., Geysen, et al. (1984) PCT 
10 application WO84/03564.) In this method, large numbers of different small test compounds are 

synthesized on a solid substrate. The test compounds are reacted with TRICH, or fragments thereof, 

and washed. Bound TRICH is then detected by methods well known in the art. Purified TRICH can 

also be coated directly onto plates for use in the aforementioned drug screening techniques. 

Alternatively, non-neutralizing antibodies can be used to capture the peptide and immobilize it on a 
15 solid support. 

In another embodiment, one may use competitive drug screening assays in which 

neutralizing antibodies capable of binding TRICH specifically compete with a test compound for 

binding TRICH. In this manner, antibodies can be used to detect the presence of any peptide which 

shares one or more antigenic determinants with TRICH. 
20 In additional embodiments, the nucleotide sequences which encode TRICH may be used in 

any molecular biology techniques that have yet to be developed, provided the new techniques rely 

on properties of nucleotide sequences that are currently known, including, but not limited to, such 

properties as the triplet genetic code and specific base pair interactions. 

Without further elaboration, it is believed that one skilled in the art can, using the preceding 
25 description, utilize the present invention to its fullest extent. The following embodiments are, 

therefore, to be construed as merely illustrative, and not limitative of the remainder of the disclosure 

in any way whatsoever. 

The disclosures of all patents, applications and publications, mentioned above and below, 

including U.S. Ser. No. 60/254,303, U.S. Ser. No. 60/256,190, U.S. Ser. No. 60/257,504, U.S. Ser. 
30 No. 60/261,546, U.S. Ser. No. 60/262,832, U.S. Ser. No. 60/264,377, and U.S. Ser. No. 60/266,019, 

are expressly incorporated by reference herein. 

EXAMPLES 

I. Construction of cDNA Libraries 

35 Incyte cDNAs were derived from cDNA libraries described in the LIFESEQ GOLD 
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database (Incyte Genomics, Palo Alto CA). Some tissues were homogenized and lysed in 
guanidinium isothiocyanate, while others were homogenized and lysed in phenol or in a suitable 
mixture of denaturants, such as TRIZOL (Life Technologies), a monophasic solution of phenol and 
guanidine isothiocyanate. The resulting lysates were centrifuged over CsCl cushions or extracted 
5 with chloroform. RNA was precipitated from the lysates with either isopropanol or sodium acetate 
and ethanol, or by other routine methods. 

Phenol extraction and precipitation of RNA were repeated as necessary to increase RNA 
purity. In some cases, RNA was treated with DNase. For most libraries, poly(A)+ RNA was 
isolated using oligo d(T)-coupled paramagnetic particles (Promega), OLIGOTEX latex particles 

10 (QIAGEN, Chatsworth CA), or an OLIGOTEX mRNA purification kit (QIAGEN). Alternatively, 
RNA was isolated directly from tissue lysates using other RNA isolation kits, e.g., the 
POLY(A)PURE mRNA purification kit (Ambion, Austin TX). 

In some cases, Stratagene was provided with RNA and constructed the corresponding 
cDNA libraries. Otherwise, cDNA was synthesized and cDNA libraries were constructed with the 

15 UNIZAP vector system (Stratagene) or SUPERSCRIPT plasmid system (Life Technologies), using 
the recommended procedures or similar methods known in the art. (See, e.g., Ausubel, 1997, supra , 
units 5.1-6.6.) Reverse transcription was initiated. using oligo d(T) or random primers. Synthetic 
oligonucleotide adapters were ligated to double stranded cDNA, and the cDNA was digested with 
the appropriate restriction enzyme or enzymes. For most libraries, the cDNA was size-selected 

20 (300-1000 bp) using SEPHACRYL S1000, SEPHAROSE CL2B, or SEPHAROSE CL4B column 
chromatography (Amersham Pharmacia Biotech) or preparative agarose gel electrophoresis. 
cDNAs were ligated into compatible restriction enzyme sites of the polylinker of a suitable plasmid, 
e.g., PBLUESCRIPT plasmid (Stratagene), PSPORT1 plasmid (Life Technologies), PCDNA2.1 
plasmid (Invitrogen, Carlsbad CA), PBK-CMV plasmid (Stratagene), PCR2-TOPOTA plasmid 

25 (Invitrogen), PCMV-ICIS plasmid (Stratagene), pIGEN (Incyte Genomics, Palo Alto CA), pRARE 
(Incyte Genomics), or pINCY (Incyte Genomics), or derivatives thereof. Recombinant plasmids 
were transformed into competent E. coli cells including XLl-Blue, XLl-BlueMRF, or SOLR from 
Stratagene or DH5a, DH10B, or ElectroMAX DH10B from Life Technologies, 
n. Isolation of cDNA Clones 

30 Plasmids obtained as described in Example I were recovered from host cells by in vivo 

excision using-the UNIZAP vector system (Stratagene) or by cell lysis. Plasmids were purified 
using at least one of the following: a Magic or WIZARD Minipreps DNA purification system 
(Promega); an AGTC Miniprep purification kit (Edge Biosystems, Gaithersburg MD); and 
QIAWELL 8 Plasmid, QIAWELL 8 Plus Plasmid, QIAWELL 8 Ultra Plasmid purification systems 

35 or the R.E. A.L. PREP 96 plasmid purification kit from QIAGEN. Following precipitation, plasmids 
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were resuspended in 0.1 ml of distilled water and stored, with or without lyophilization, at 4°C. 

Alternatively, plasmid DNA was amplified from host cell lysates using direct link PGR in a 
high-throughput format (Rao, V.B. (1994) Anal. Biochem. 216: 1-14). Host cell lysis and thermal 
cycling steps were carried out in a single reaction mixture. Samples were processed and stored in 

5 384-well plates, and the concentration of amplified plasmid DNA was quantified fluorometrically 
using PICOGREEN dye (Molecular Probes, Eugene OR) and a FLUOROSKAN II fluorescence 
scanner (Labsystems Oy, Helsinki, Finland). 
IIL Sequencing and Analysis 

Incyte cDNA recovered in plasmids as described in Example H were sequenced as follows. 

10 Sequencing reactions were processed using standard methods or high-throughput instrumentation 
such as the ABI CATALYST 800 (Applied Biosystems) thermal cycler or the PTC-200 thermal 
cycler (MJ Research) in conjunction with the HYDRA microdispenser (Robbins Scientific) or the. ✓ ' 
MICROLAB 2200 (Hamilton) liquid transfer system. cDNA sequencing reactions were prepared 
using reagents provided by Amersham Pharmacia Biotech or supplied in ABI sequencing kits such 

15 as the ABI PRISM BIGDYE Terminator cycle sequencing ready reaction kit (Applied Biosystems). 
Electrophoretic separation of cDNA sequencing reactions and detection of labeled polynucleotides 
were carried out using the MEGABACE 1000 DNA sequencing system (Molecular Dynamics); the 
ABI PRISM 373 or 377 sequencing system (Applied Biosystems) in conjunction with standard ABI 
protocols and base calling software; or other sequence analysis systems known in the art. Reading 

20 frames within the cDNA sequences were identified using standard methods (reviewed in Ausubel, 
1997, supra , unit 7.7). Some of the cDNA sequences were selected for extension using the 
techniques disclosed in Example VIE. 

The polynucleotide sequences derived from Incyte cDNAs were validated by removing ' 
vector, linker, and poly(A) sequences and by masking ambiguous bases, using algorithms and 

25 programs based on BLAST, dynamic programming, and dinucleotide nearest neighbor analysis. 
The Incyte cDNA sequences or translations thereof were then queried against a selection of public 
databases such as the GenBank primate, rodent, mammalian, vertebrate, and eukaryote databases, 
and BLOCKS, PRINTS, DOMO, PRODOM; PROTEOME databases with sequences from Homo 
sapiens , Rattus norvegicus . Mus musculus , Caenorhabditis elegans , Saccharomvces cerevisiae , 

30 Schizosaccharomvces pombe . and Candida albicans (Incyte Genomics, Palo Alto CA); and hidden 
Markov model (HMM)-based protein family databases such as PFAM. (HMM is a probabilistic 
approach which analyzes consensus primary structures of gene families. See, for example, Eddy, 
S.R. (1996) Curr. Opin. Struct Biol. 6:361-365.) The queries were performed using programs 
based on BLAST, FAST A, BLIMPS, and HMMER. The Incyte cDNA sequences were assembled 

35 to produce full length polynucleotide sequences. Alternatively, GenBank cDNAs, GenBank ESTs, 
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stitched sequences, stretched sequences, or Genscan-predicted coding sequences (see Examples IV 
and V) were used to extend Incyte cDNA assemblages to full length. Assembly was performed 
using programs based on Phred, Phrap, and Consed, and cDNA assemblages were screened for opei 
reading frames using programs based on GeneMark, BLAST, and FASTA. The full length 
5 polynucleotide sequences were translated to derive the corresponding full length polypeptide 
sequences. Alternatively, a polypeptide of the invention may begin at any of the methionine 
residues of the full length translated polypeptide. Full length polypeptide sequences were 
subsequently analyzed by querying against databases such as the GenBank protein databases 
(genpcpt), SwissProt, the PROTEOME databases, BLOCKS, PRINTS, DOMO, PRODOM, Prosite, 
10 and hidden Markov model (HMM)-based protein family databases such as PFAM. Full length 
polynucleotide sequences are also analyzed using MACDNASIS PRO software (Hitachi Software 
Engineering, South San Francisco CA) and LASERGENE software (DNASTAR). Polynucleotide'' 
and polypeptide sequence alignments are generated using default parameters specified by the 
CLUSTAL algorithm as incorporated into the MEGALIGN multisequence alignment program 
15 (DNASTAR), which also calculates the percent identity between aligned sequences. 

Table 7 summarizes the tools, programs, and algorithms used for the analysis and assembly 
of Incyte cDNA and full length sequences and provides applicable descriptions, references, and 
threshold parameters. The first column of Table 7 shows the tools, programs, and algorithms used, 
the second column provides brief descriptions thereof, the third column presents appropriate 
20 references, all of which are incorporated by reference herein in their entirety, and the fourth column 
presents, where applicable, the scores, probability values, and other parameters used to evaluate the 
strength of a match between two sequences (the higher the score or the lower the probability value, 
the greater the identity between two sequences). 

The programs described above for the assembly and analysis of full length polynucleotide 
25 and polypeptide sequences were also used to identify polynucleotide sequence fragments from SEQ 
ID NO:33-64. Fragments from about 20 to about 4000 nucleotides which are useful in hybridization 
and amplification technologies are described in Table 4, column 2. 
IV. Identification and Editing of Coding Sequences from Genomic DNA 

Putative transporters and ion channels were initially identified by running the Genscan gene 
30 identification program against public genomic sequence databases (e.g., gbpri and gbhtg). Genscan 
is a general-purpose gene identification program which analyzes genomic DNA sequences from a 
variety of organisms (See Burge, C. and S. Karlin (1997) J. Mol. Biol. 268:78-94, and Burge, C. and 
S. Karlin (1998) Curr. Opin. Struct. Biol. 8:346-354). The program concatenates predicted exons to 
form an assembled cDNA sequence extending from a methionine to a stop codon. The output of 
35 Genscan is a FASTA database of polynucleotide and polypeptide sequences. The maximum range 
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of sequence for Genscan to analyze at once was set to 30 kb. To determine which of these Genscan 
predicted cDNA sequences encode transporters and ion channels, the encoded polypeptides were 
analyzed by querying against PFAM models for transporters and ion channels. Potential 
transporters and ion channels were also identified by homology to Incyte cDNA sequences that had 
5 been annotated as transporters and ion channels. These selected Genscan-predicted sequences were 
then compared by BLAST analysis to the genpept and gbpri public databases. Where necessary, the 
Genscan-predicted sequences were then edited by comparison to the top BLAST hit from genpept to 
correct errors in the sequence predicted by Genscan, such as extra or omitted exons. BLAST ' 
analysis was also used to find any Incyte cDNA or public cDNA coverage of the Genscan-predicted 

10 sequences, thus providing evidence for transcription. When Incyte cDNA coverage was available, 
this information was used to correct or confirm the Genscan predicted sequence! * Full length 
polynucleotide sequences were obtained by assembling Genscan-predicted coding sequences with. ✓ 
Incyte cDNA sequences and/or public cDNA sequences using the assembly process described in 
Example HI. Alternatively, full length polynucleotide sequences were derived entirely from edited 

15 or unedited Genscan-predicted coding sequences. 

V. Assembly of Genomic Sequence Data with cDNA Sequence Data 
"Stitched" Sequences 

Partial cDNA sequences were extended with exons predicted by the Genscan gene 
identification program described in Example IV. Partial cDNAs assembled as described in Example 

20 HI were mapped to genomic DNA and parsed into clusters containing related cDNAs and Genscan 
exon predictions from one or more genomic sequences. Each cluster was analyzed using an 
algorithm based on graph theory and dynamic programming to integrate cDNA and genomic 
information, generating possible splice variants that were subsequently confirmed, edited, or 
extended to create a full length sequence. Sequence intervals in which the entire length of the 

25 interval was present on more than one sequence in the cluster were identified, and intervals thus 
identified were considered to be equivalent by transitivity. For example, if an interval was present 
on a cDNA and two genomic sequences, then all three intervals were considered to be equivalent. 
This process allows unrelated but consecutive genomic sequences to be brought together, bridged by 
cDNA sequence. Intervals thus identified were then "stitched" together by the stitching algorithm 

30 in the order that they appear along their parent sequences to generate the longest possible sequence, 
as well as sequence variants. Linkages between intervals which proceed along one type of parent 
sequence (cDNA to cDNA or genomic sequence to genomic sequence) were given preference over 
linkages which change parent type (cDNA to genomic sequence). The resultant stitched sequences 
were translated and compared by BLAST analysis to the genpept and gbpri public databases. 

35 Incorrect exons predicted by Genscan were corrected by comparison to the top BLAST hit from 
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genpept. Sequences were further extended with additional cDNA sequences, or by inspection of 
genomic DNA, when necessary. 
"Stretched" Sequences 

Partial DNA sequences were extended to full length with an algorithm based on BLAST 
5 analysis. First, partial cDNAs assembled as described in Example EI were queried against public 
databases such as the GenBank primate, rodent, mammalian, vertebrate, and eukaryote databases 
using the BLAST program. The nearest GenBank protein homolog was then compared by BLAST 
analysis to either Incyte cDNA sequences or GenScan exon predicted sequences described in 
Example IV. A chimeric protein was generated by using the resultant high-scoring segment pairs 

10 (HSPs) to map the translated sequences onto the GenBank protein homolog. Insertions or deletions 
may occur in the chimeric protein with respect to the original GenBank protein homolog. The 
GenBank protein homolog, the chimeric protein, or both were used as probes to search for . ✓ 

homologous genomic sequences from the public human genome databases. Partial DNA sequences 
were therefore "stretched" or extended by the addition of homologous genomic sequences. The 

15 resultant stretched sequences were examined to determine whether it contained a complete gene. 
VI. Chromosomal Mapping of TRICH Encoding Polynucleotides 

The sequences which were used to assemble SEQ ID NO:33-64 were compared with 
sequences from the Incyte LIFESEQ database and public domain databases using BLAST and other 
implementations of the Smith- Waterman algorithm. Sequences from these databases that matched 

20 SEQ ID NO:33-64 were assembled into clusters of contiguous and overlapping sequences using 
assembly algorithms such as Phrap (Table 7). Radiation hybrid and genetic mapping data available 
from public resources such as the Stanford Human Genome Center (SHGC), Whitehead Institute for 
Genome Research (WIGR), and Genethon were used to determine if any of the clustered sequences 
had been previously mapped. Inclusion of a mapped sequence in a cluster resulted in the 

25 assignment of all sequences of that cluster, including its particular SEQ ID NO:, to that map 
location. 

Map locations are represented by ranges, or intervals, of human chromosomes. The map 
position of an interval, in centiMorgans, is measured relative to the terminus of the chromosome's 
p-arm. (The centiMorgan (cM) is a unit of measurement based on recombination frequencies 

30 between chromosomal markers. On average, 1 cM is roughly equivalent to 1 megabase (Mb) of 
DNA in humans, although this can vary widely due to hot and cold spots of recombination.) The 
cM distances are based on genetic markers mapped by Genethon which provide boundaries for 
radiation hybrid markers whose sequences were included in each of the clusters. Human genome 
maps and other resources available to the public, such as the NCBI "GeneMap'PQ" World Wide 

35 Web site (http://www.ncbi.nlm.nih.gov/genemap/), can be employed to determine if previously 
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identified disease genes map within or in proximity to the intervals indicated above. 
VII. Analysis of Polynucleotide Expression 

Northern analysis is a laboratory technique used to detect the presence of a transcript of a 
gene and involves the hybridization of a labeled nucleotide sequence to a membrane on which 
5 RNAs from a particular cell type or tissue have been bound. (See, e.g., Sambrook, supra , ch. 7; 
Ausubel (1995) supra , ch. 4 and 16.) 

Analogous computer techniques applying BLAST were used to search for identical or 
related molecules in cDNA databases such as GenBank or LIFESEQ (Incyte Genomics). This 
analysis is much faster than multiple membrane-based hybridizations. In addition, the sensitivity of 

10 the computer search can be modified to determine whether any particular match is categorized as 
exact or similar. The basis of the search is the product score, which is defined as: 

• • - * - . y 

BLAST Score x Percent Identity 
5 x minimum {length(Seq. 1), length(Seq. 2)} 

15 . 
The product score takes into account both the degree of similarity between two sequences and the 
length of the sequence match. The product score is a normalized value between 0 and 100, and is 
calculated as follows: the BLAST score is multiplied by the percent nucleotide identity and the 
product is divided by (5 times the length of the shorter of the two sequences). The BLAST score is 

20 calculated by assigning a score of +5 for every base that matches in a high-scoring segment pair 
(HSP), and -4 for every mismatch. Two sequences may share more than one HSP (separated by 
gaps). . If there is more than one HSP, then the pair with the highest BLAST score is used to 
calculate the product score. The product score represents a balance between fractional overlap and 
quality in a BLAST alignment. For example, a product score of 100 is produced only for 100% 

25 identity over the entire length of the shorter of the two sequences being compared. A product score 
of 70 is produced either by 100% identity and 70% overlap at one end, or by 88% identity and 100% 
overlap at the other. A product score of 50 is produced either by 100% identity and 50% overlap at 
one end, or 79% identity and 100% overlap. 

Alternatively, polynucleotide sequences encoding TRICH are analyzed with respect to the 

30 tissue sources from which they were derived. For example, some full length sequences are 

assembled, at least in part, with overlapping Incyte cDNA sequences (see Example III). Each cDNA 
sequence is derived from a cDNA library constructed from a human tissue. Each human tissue is 
classified into one of the following organ/tissue categories: cardiovascular system; connective 
tissue; digestive system; embryonic structures; endocrine system; exocrine glands; genitalia, female; 

35 genitalia, male; germ cells; hemic and immune system; liver; musculoskeletal system; nervous 
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system; pancreas; respiratory system; sense organs; skin; stomatognathic system; 
unclassified/mixed; or urinary tract. The number of libraries in each category is counted and 
divided by the total number of libraries across all categories. Similarly, each human tissue is 
classified into one of the following disease/condition categories: cancer, cell line, developmental, 
5 inflammation, neurological, trauma, cardiovascular, pooled, and other, and the number of libraries 
in each category is counted and divided by the total number of libraries across all categories. The 
resulting percentages reflect the tissue- and disease-specific expression of cDNA encoding TRICH. 
cDNA sequences and cDNA library/tissue information are found in the LIFESEQ GOLD database 
(Incyte Genomics, Palo Alto CA). 

10 VIIL Extension of TRICH Encoding Polynucleotides 

Full length polynucleotide sequences were also produced by extension of an appropriate 
fragment of the full length molecule using oligonucleotide primers designed from this fragment. . ✓ 
One primer was synthesized to initiate 5' extension of the known fragment, and the other primer was 
synthesized to initiate 3' extension of the known fragment. The initial primers were designed using 

15 OLIGO 4.06 software (National Biosciences), or another appropriate program, to be about 22 to 30 
nucleotides in length, to have a GC content of about 50% or more, and to anneal to the target 
sequence at temperatures of about 68 °C to about 72°C. Any stretch of nucleotides which would 
result in hairpin structures and primer-primer dimerizations was avoided. 

Selected human cDNA libraries were used to extend the sequence. If more than one 

20 extension was necessary or desired, additional or nested sets of primers were designed. 

High fidelity amplification was obtained by PGR using methods well known in the art. PCR 
was performed in 96-well plates using the PTC-200 thermal cycler (MJ Research, Inc.). The 
reaction mix contained DNA template, 200 nmol of each primer, reaction buffer containing Mg 2+ , 
(NHi) 2 S0 4 , and 2-mercaptoethanol, Taq DNA polymerase (Amersham Pharmacia Biotech), 

25 ELONGASE enzyme (Life Technologies), and Pfu DNA polymerase (Stratagene), with the 

following parameters for primer pair PCI A and PCI B: Step 1: 94°C, 3 min; Step 2: 94°C, 15 sec; 
Step 3: 60°C, 1 min; Step 4: 68°C, 2 min; Step 5: Steps 2, 3, and 4 repeated 20 times; Step 6: 68°C, 
5 min; Step 7: storage at 4°C. In the alternative, the parameters for primer pair T7 and SK+ were as 
follows: Step 1: 94°C, 3 min; Step 2: 94°C, 15 sec; Step 3: 57°C, 1 min; Step 4: 68°C, 2 min; Step 

30 5: Steps 2, 3, and 4 repeated 20 times; Step 6: 68 °C, 5 min; Step 7: storage at 4°C. 

The concentration of DNA in each well was determined by dispensing 100 fi\ PICOGREEN 
quantitation reagent (0.25% (v/v) PICOGREEN; Molecular Probes, Eugene OR) dissolved in IX TE 
and 0.5 [il of undiluted PCR product into each well of an opaque fluorimeter plate (Corning Costar, 
Acton MA), allowing the DNA to bind to the reagent. The plate was scanned in a Huoroskan II 

35 (Labsystems Oy , Helsinki, Finland) to measure the fluorescence of the sample and to quantify the . 
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concentration of DNA. A 5 }A to 10 /il aliquot of the reaction mixture was analyzed by 
electrophoresis on a 1 % agarose gel to determine which reactions were successful in extending the 
sequence. 

The extended nucleotides were desalted and concentrated, transferred to 384 : well plates, 
5 digested with CviJI cholera virus endonuclease (Molecular Biology Research, Madison WI), and 
sonicated or sheared prior to religation into pUC 18 vector (Amersham Pharmacia Biotech). For 
shotgun sequencing, the digested nucleotides were separated on low concentration (0.6 to 0.8%) 
agarose gels, fragments were excised, and agar digested with Agar ACE (Promega). Extended 
clones were religated using T4 ligase (New England Biolabs, Beverly MA) into pUC i8 vector 

10 (Amersham Pharmacia Biotech), treated with Pfu DNA polymerase (Straiagene) to fill-in restriction 
site overhangs, and transfected into competent E. coli cells. Transformed cells were selected on 
antibiotic-containing media, and individual colonies were picked and cultured overnight at 37°C in*' 
384-well plates in LB/2x carb liquid media. 

The cells were lysed, and DNA was amplified by PCR using Taq DNA polymerase 

15 (Amersham Pharmacia Biotech) and Pfu DNA polymerase (Stratagene) with the following 

parameters: Step 1: 94°C, 3 min; Step 2: 94°C, 15 sec; Step 3: 60°C, 1 min; Step 4: 72°C, 2 min; 
Step 5: steps 2, 3, and 4 repeated 29 times; Step 6: 72°C, 5 min; Step 7: storage at 4°C. DNA was 
quantified by PICOGREEN reagent (Molecular Probes) as described above. Samples with low 
DNA recoveries were reamplified using the same conditions as described above. Samples were 

20 diluted with 20% dimethysulfoxide (1:2, v/v), and sequenced using DYENAMIC energy transfer 
sequencing primers and the DYENAMIC DIRECT kit (Amersham Pharmacia Biotech) or the ABI 
PRISM BIGDYE Terminator cycle sequencing ready reaction kit(Applied Biosystems). 

In like manner, full length polynucleotide sequences are verified using the above procedure 
or are used to obtain 5'regulatory sequences using the above procedure along with oligonucleotides 

25 designed for such extension, and an appropriate genomic library. 
IX. Labeling and Use of Individual Hybridization Probes 

Hybridization probes derived from SEQ ID NO.33-64 are employed to screen cDNAs, 
genomic DNAs, or mRNAs. Although the labeling of oligonucleotides, consisting of about 20 base 
pairs, is specifically described, essentially the same procedure is used with larger nucleotide 

30 fragments. Oligonucleotides are designed using state-of-the-art software such as OLIGO 4.06 
software (National Biosciences) and labeled by combining 50 pmol of each oligomer, 250 fxCi of 
[y- 32 P] adenosine triphosphate (Amersham Pharmacia Biotech), and T4 polynucleotide kinase 
(DuPont NEN, Boston MA). The labeled oligonucleotides are substantially purified using a 
SEPHADEX G-25 superfine size exclusion dextran bead column (Amersham Pharmaciai Biotech). 

35 An aliquot containing 10 7 counts per minute of the labeled probe is used in a typical membrane- 
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based hybridization analysis of human genomic DNA digested with one of the following 
endonucleases: Ase I, Bgl II, Eco RI, Pst I, Xba I, or. Pvu O (DuPont NEN). 

The DNA from each digest is fractionated on a 0.7% agarose gel and transferred to nylon 
membranes (Nytran Plus, Schleicher & Schuell, Durham NH). Hybridization is carried out for 16 
5 hours at 40 °C. To remove nonspecific signals, blots are sequentially washed at room temperature 
under conditions of up to, for example, 0.1 x saline sodium citrate and 0.5% sodium dodecyl sulfate. 
Hybridization patterns are visualized using autoradiography or an alternative imaging means and 
compared. 

X. Microarrays 

10 The linkage or synthesis of array elements upon a microarray can be achieved utilizing 

photolithography, piezoelectric printing (ink-jet printing, See, e.g., Baldeschweiler, supra .V 
mechanical microspotting technologies, and derivatives thereof. The substrate in each of the ' 
aforementioned technologies should be uniform and solid with a non-porous surface (Schena 
(1999), supra). Suggested substrates include silicon, silica, glass slides, glass chips, and silicon 

15 wafers. Alternatively, a procedure analogous to a dot or slot blot may also be used to arrange and 
link elements to the surface of a substrate using thermal, UV, chemical, or mechanical bonding 
procedures. A typical array may be produced using available methods and machines well known to 
those of ordinary skilj in the art and may contain any appropriate number of elements. (See, e.g., 
Schena, M. et al. (1995) Science 270:467-470; Shalon, D. et al. (1996) Genome Res. 6:639-645; 

20 Marshall, A. and J. Hodgson (1998) Nat. Biotechnol. 16:27-31.) 

Full length cDNAs, Expressed Sequence Tags (ESTs), or fragments or oligomers thereof 
may comprise the elements of the microarray. Fragments or oligomers suitable for hybridization 
can be selected using software well known in the art such as LASERGENE software (DNASTAR). 
The array elements are hybridized with polynucleotides in a biological sample. The polynucleotides 

25 in the biological sample are conjugated to a fluorescent label or other molecular tag for ease of 
detection. After hybridization, nonhybridized nucleotides from the biological sample are removed, 
and a fluorescence scanner is used to detect hybridization at each array element. Alternatively, laser 
desorbtion and mass spectrometry may be used for detection of hybridization. The degree of 
complementarity and the relative abundance of each polynucleotide which hybridizes to an element 

30 on the microarray may be assessed. In one embodiment, microarray preparation and usage is 
described in detail below. 
Tissue or Cell Sample Preparation 

Total RNA is isolated from tissue samples using the guanidinium thiocyanate method and 
poly(A) + RNA is purified using the oligo-(dT) cellulose method. Each poly(A) + RNA sample is 

35 reverse transcribed using MMLV reverse-transcriptase, 0.05 pg//xl oligo-(dT) primer (21mer), IX 
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first strand buffer, 0.03 units//*! RNase inhibitor, 500 fM dATP, 500 fiM dGTP, 500 /iM dTTP, 40 
/zM dCTP, 40 fM dCTP-Cy3 (BDS) or dCTP-Cy5 (Amersham Pharmacia Biotech). The reverse 
transcription reaction is performed in a 25 ml volume containing 200 ng poly (A) + RNA with 
GEMB RIGHT kits (Incyte). Specific control poly(A) + RNAs are synthesized by in vitro 
5 transcription from non-coding yeast genomic DNA. After incubation at 37 °C for 2 hr, each reaction 
sample (one with Cy3 and another with Cy5 labeling) is treated with 2.5 ml of 0.5M sodium 
hydroxide and incubated for 20 minutes at 85°C to the stop the reaction and degrade the RNA. 
Samples are purified using two successive CHROMA SPIN 30 gel filtration spin columns 
(CLONTECH Laboratories, Inc. (CLONTECH), Palo Alto CA) and after combining, both reaction 

10 samples are ethanol precipitated using 1 ml of glycogen (1 mg/ml), 60 ml sodium acetate, and 300 
ml of 100% ethanol. The sample is then dried to completion using a SpeedVAC (Savant 
Instruments Inc., 1 lolbntok NY ) and resuspended in 14 fil 5X SSC/0.2% SDS. 
Microarrav Pre p j rat ion 

Sequences of the present invention are used to generate array elements. Each array element 

15 is amplified from bacterial cells containing vectors with cloned cDNA inserts. PCR amplification 
uses primers complementary to the vector sequences flanking the cDNA insert. Array elements are 
amplified in thirty cycles of PCR from an initial quantity of 1-2 ng to a final quantity greater than 5 
fig. Amplified array elements arc then purified using SEPHACRYL-400 (Amersham Pharmacia 
Biotech). 

20 Purified array elements are immobilized on polymer-coated glass slides. Glass microscope 

slides (Corning) arc cleaned by ultrasound in 0.1% SDS and acetone, with extensive distilled water 
washes between and after treatments. Glass slides are etched in 4% hydrofluoric acid (VWR 
Scientific Products Corporation (VWR), West Chester PA), washed extensively in distilled water, 
and coated with 0.05% aminopropyl silane (Sigma) in 95% ethanol. Coated slides are cured in a 

25 110°Coven. 

Array elements arc applied to the coated glass substrate using a procedure described in U.S. 
Patent No. 5,807,522, incorporated herein by reference. 1 /il of the anay element DNA, at an 
average concentration of 100 ng//d, is loaded into the open capillary printing element by a high- 
speed robotic apparatus. The apparatus then deposits about 5 nl of array element sample per slide. 

30 Microarrays are UV-crosslinked using a STRATALINKER UV-crosslinker (Stratagene). 

Microarrays are washed at room temperature once in 0.2% SDS and three times in distilled water. 
Non-specific binding sites are blocked by incubation of microarrays in 0.2% casein in phosphate 
buffered saline (PBS) (Tropix, Inc., Bedford MA) for 30 minutes at 60°C followed by washes in 
0.2% SDS and distilled water as before. 

35 Hybridization 
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Hybridization reactions contain 9 /il of sample mixture consisting of 0.2 fig each of Cy3 and 
Cy5 .labeled cDNA synthesis products in 5X SSC, 0.2% SDS hybridization buffer. The sample 
mixture is heated to 65° C for 5 minutes and is aliquoted onto the microarray surface and covered 
with an 1.8 cm 2 coverslip. The arrays are transferred to a waterproof chamber having a cavity just 
5 slightly larger than a microscope slide. The chamber is kept at 100% humidity internally by the 
addition of 140 /xl of 5X SSC in a corner of the chamber. The chamber containing the arrays is 
incubated for about 6.5 hours at 60° C. The arrays are washed for 10 min at 45° C in a first wash 
buffer (IX SSC, 0.1% SDS), three times for 10 minutes each at 45°C in a second wash buffer (0.1X 
SSG), and dried. 
10 Detection 

Reporter-labeled hybridization complexes are detected with a microscope equipped with an 
Innova 70 mixed gas 10 W laser (Coherent, Inc., Santa Clara CA) capable of generating spectral ' 
lines at 488 nm for excitation of Cy3 and at 632 nm for excitation of Cy5. The excitation laser light 
is focused on the array using a 20X microscope objective (Nikon, Inc., Melville NY). The slide 

15 containing the array is placed on a computer-controlled X-Y stage on the microscope and raster- 
scanned past the objective. The 1.8 cm x 1.8 cm array used in the present example is scanned with 
a resolution of 20 micrometers. 

In two separate scans, a mixed gas multiline laser excites the two fluorophores sequentially. 
Emitted light is split, based on wavelength, into two photomultiplier tube detectors (PMT R1477, 

20 Hamamatsu Photonics Systems, Bridgewater NJ) corresponding to the two fluorophores. 

Appropriate filters positioned between the array and the photomultiplier tubes are used to filter the 
signals. The emission maxima of the fluorophores used are 565 nm for Cy3 and 650 nm for Cy5. 
Each array is typically scanned twice, one scan per fluorophore using the appropriate filters at the 
laser source, although the apparatus is capable of recording the spectra from both fluorophores 

25 simultaneously. 

The sensitivity of the scans is typically calibrated using the signal intensity generated by a 
cDNA control species added to the sample mixture at a known concentration. A specific location 
on the aiTay contains a complementary DNA sequence, allowing the intensity of the signal at that 
location to be correlated with a weight ratio of hybridizing species of 1:100,000. When two 

30 samples from different sources (e.g., representing test and control cells), each labeled with a 

different fluorophore, are hybridized to a single array for the purpose of identifying genes that are 
differentially expressed, the calibration is done by labeling samples of the calibrating cDNA with 
the two fluorophores and adding identical amounts of each to the hybridization mixture. 

The output of the photomultiplier tube is digitized using a 12-bit RTI-835H analog-to- 

35 digital (A/D) conversion board (Analog Devices, Inc., Norwood MA) installed in an IBM- 
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compatible PC computer. The digitized data are displayed as an image where the signal intensity is 
mapped using a linear 20-color transformation to a pseudocolor scale ranging from blue (low 
signal) to red (high signal). The data is also analyzed quantitatively. Where two different 
fluorophores are excited and measured simultaneously, the data are first corrected for optical 
5 crosstalk (due to overlapping emission spectra) between the fluorophores using each fluorophore's 
emission spectrum. 

A grid is superimposed over the fluorescence signal image such that the signal from each 
spot is centered in each element of the grid. The fluorescence signal within each element is then 
integrated to obtain a numerical value corresponding to the average intensity of the signal. The 
10 software used for signal analysis is the GEMTOOLS gene expression analysis program (Incyte). 

XI. Complementary Polynucleotides 

Sequences complementary to the TRICH-encoding sequences, or any parts thereof, are used 
to detect, decrease, or inhibit expression of naturally occurring TRICH. Although use of 
oligonucleotides comprising from about 15 to 30 base pairs is described, essentially the same 

15 procedure is used with smaller or with larger sequence fragments. Appropriate oligonucleotides are 
designed using OLIGO 4.06 software (National Biosciences) and the coding sequence of TRICH. 
To inhibit transcription, a complementary oligonucleotide is designed from the most unique 5 s 
sequence and used to prevent promoter binding to the coding sequence. To inhibit translation, a 
complementary oligonucleotide is designed to prevent ribosomal binding to the TRICH-encoding 

20 transcript. 

XII. Expression of TRICH 

Expression and purification of TRICH is achieved using bacterial or virus-based expression 
systems. For expression of TRICH in bacteria, cDNA is subcloned into an appropriate vector 
containing an antibiotic resistance gene and an inducible promoter that directs high levels of cDNA 

25 transcription. Examples of such promoters include, but are not limited to, the trp-lac (tac) hybrid 
promoter and the T5 or T7 bacteriophage promoter in conjunction with the lac operator regulatory 
element. Recombinant vectors are transformed into suitable bacterial hosts, e.g., BL21(DE3). 
Antibiotic resistant bacteria express TRICH upon induction with isopropyl beta-D- 
thiogalactopyranoside (IPTG). Expression of TRICH in eukaryotic cells is achieved by infecting 

30 insect or mammalian cell lines with recombinant Autographica californica nuclear polyhedrosis 
virus (AcMNPV), commonly known as baculovirus. The nonessential polyhedrin gene of 
baculovirus is replaced with cDNA encoding TRICH by either homologous recombination or 
bacterial-mediated transposition involving transfer plasmid intermediates. Viral infectivity is 
maintained and the strong polyhedrin promoter drives high levels of cDNA transcription. 

35 Recombinant baculovirus is used to infect Spodoptera frugjperda (Sf9) insect cells in most cases, or 
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washed under conditions that allow the preferential absorbance of TRICH (e.g., high ionic strength 
buffers in the presence of detergent). The column is eluted under conditions that disrupt 
antibody/TRICH binding (e.g., a buffer of pH 2 to pH 3, or a high concentration of a chaotrope, 
such as urea or thiocyanate ion), and TRICH is collected. 
5 XVI. Identification of Molecules Which Interact with TRICH 

Molecules which interact with TRICH may include transporter substrates, agonists or 
antagonists, modulatory proteins such as Gffy proteins (Reimann, supra) or proteins involved in 
TRICH localization or clustering such as MAGUKs (Craven, supra) . TRICH, or biologically active 
fragments thereof, are labeled with 125 I Bolton-Hunter reagent. (See, e.g., Bolton A.E. and W.M. 
10 Hunter (1973) Biochem. J. 133:529-539.) Candidate molecules previously arrayed in the wells of a 
multi-well plate are incubated with the labeled TRICH, washed, and any wells with labeled TRICH 
complex are assayed. Data obtained using different concentrations of TRICH are used to calculate 
values for the number, affinity, and association of TRICH with the candidate molecules. 

Alternatively, proteins that interact with TRICH are isolated using the yeast 2-hybrid system 
15 (Fields, S. and O. Song (1989) Nature 340:245-246). TRICH, or fragments thereof, are expressed as 
fusion proteins with the DNA binding domain of Gal4 or lexA, and potential interacting proteins are 
expressed as fusion proteins with an activation domain. Interactions between the TRICH fusion 
protein and the TRICH interacting proteins (fusion proteins with an activation domain) reconstitute 
a transactivation function that is observed by expression of a reporter gene. Yeast 2-hybrid systems 
20 are commercially available, and methods for use of the yeast 2-hybrid system with ion channel 
proteins are discussed in Niethammer, M. and M. Sheng (1998, Meth. Enzymol. 293:104-122). 

TRICH may also be used in the PATHCALLING process (CuraGen Corp., New Haven CT) 
which employs the yeast two-hybrid system in a high-throughput manner to determine all 
interactions between the proteins encoded by two large libraries of genes (Nandabalan, K. et al. 
25 (2000) U.S. Patent No. 6,057,101). 

Potential TRICH agonists or antagonists may be tested for activation or inhibition of 
TRICH ion channel activity using the assays described in section XVm. 
XVDL Demonstration of TRICH Activity 

Ion channel activity of TRICH is demonstrated using an electrophysiological assay for ion 
30 conductance. TRICH can be expressed by transforming a mammalian cell line such as COS7, HeLa 
or CHO with a eukaryotic expression vector encoding TRICH. Eukaryotic expression vectors are 
commercially available, and the techniques to introduce them into cells are well known to those 
skilled in the art. A second plasmid which expresses any one of a number of marker genes, such as 
6-galactosidase, is co-transformed into the cells to allow rapid identification of those cells which 
35 have taken up and expressed the foreign DNA. The cells are incubated for 48-72 hours after 
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transformation under conditions appropriate for the cell line to allow expression and accumulation 
of TRICH and B-galactosidase. 

Transformed cells expressing B-galactosidase are stained blue when a suitable colorimetric 
substrate is added to the culture media under conditions that are' well known in the art. Stained cells 
5 are tested for differences in membrane conductance by electrophysiological techniques that are well 
known in the art. Untransformed cells, and/or cells transformed with either vector sequences alone 
or B-galactosidase sequences alone, are used as controls and tested in parallel. Cells expressing 
TRICH will have higher anion or cation conductance relative to control cells. The contribution of 
TRICH to conductance can be confirmed by incubating the cells using antibodies specific for 

10 TRICH. The antibodies will bind to the extracellular side of TRICH, thereby blocking the pore in 
the ion channel, and the associated conductance. 

Alternatively, ion channel activity of TRICH is measured as current flow across a TRICH- > 
containing Xenopus laevis oocyte membrane using the two-electrode voltage-clamp technique (Ishi 
et al., supra ; Jegla, T. and L. Salkoff (1997) J. Neurosci. 17:32-44). TRICH is subcloned into an 

15 appropriate Xenopus oocyte expression vector, such as pBF, and 0.5-5 ng of mRNA is injected into 
mature stage IV oocytes. Injected oocytes are incubated at 18 °C for 1-5 days. Inside-out 
macropatches are excised into an intracellular solution containing 116 mM K-gluconate, 4 mM KC1, 
and 10 mM Hepes (pH 7.2), The intracellular solution is supplemented with varying concentrations 
of the TRICH mediator, such as cAMP, cGMP, or Ca +2 (in the form of CaCy, where appropriate. 

20 Electrode resistance is set at 2-5 Mfi and electrodes are filled with the intracellular solution lacking 
mediator. Experiments are performed at room temperature from a holding potential of 0 mV. 
Voltage ramps (2.5 s) from -100 to 100 mV are acquired at a sampling frequency of 500 Hz. 
Current measured is proportional to the activity of TRICH in the assay. In particular, the activity of 
TRICH-1 is measured as Na + conductance and the activity of TRICH-3 is measured as Ca 2+ 

25 conductance. 

Transport activity of TRICH is assayed by measuring uptake of labeled substrates into 
Xenopus laevis oocytes. Oocytes at stages V and VI are injected with TRICH mRNA (10 ng per 
oocyte) and incubated for 3 days at 18°C in OR2 medium (82.5mM NaCl, 2.5 mM KC1, ImM CaCl 
2 , ImM MgCl 2 , ImM Na 2 HP0 4 , 5 mM Hepes, 3.8 mM NaOH , 50jug/ml gentamycin, pH 7.8) to 

30 allow expression of TRICH. Oocytes are then transferred to standard uptake medium (lOOmM 
NaCl, 2 mM KC1, ImM CaCl 2 , ImM MgCl 2 , 10 mM Hepes/Tris pH 7.5). Uptake of various 
substrates (e.g., amino acids, sugars, drugs, ions, and neurotransmitters) is initiated by adding 
labeled substrate (e.g. radiolabeled with 3 H, fluorescently labeled with rhodamine, etc.) to the 
oocytes. After incubating for 30 minutes, uptake is terminated by washing the oocytes three times 

35 in NaMree medium, measuring the incorporated label, and comparing with controls. TRICH 
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activity is proportional to the level of internalized labeled substrate. Test substrates include ran- 
GTP for TRICH- 2, glucose for TRICH-5, amino acids for TRICH-6 and TRICH-14, cations for 
TRICH-7 and TRICH-16, Na + , K + and CI" ions for TRICH-15, reduced folate or analogues such as 
methotrexate for TRICH-17, divalent cations for TRICH-18, anions such as arsenate and antimonite 
5 for TRICH-1 9, and nitrate or oligopeptides for TRICH-20. 

ATPase activity associated with TRICH can be measured by hydrolysis of radiolabeled 
ATP-[y- 32 P], separation of the hydrolysis products by chromatographic methods, and quantitation of 
the recovered 32 P using a scintillation counter. The reaction mixture contains ATP-[y- 32 P] and 
varying amounts of TRICH in a suitable buffer incubated at 37°C for a suitable period of time. The 

10 reaction is terminated by acid precipitation with trichloroacetic acid and then neutralized with base, 
and an aliquot of the reaction mixture is subjected to membrane or filter paper-based 
chromatography to separate the reaction products. The amount of 32 P liberated is counted in a . ✓ 
scintillation counter. The amount of radioactivity recovered is proportional to the ATPase activity 
of TRICH in the assay. 

15 Lipocalin activity of TRICH is measured by ligand fluorescence enhancement 

spectrofluorometry (Lin et al. (1997) Molecular Vision 3: 17). Examples of ligands include retinol 
(Sigma, St. Louis MO) and 16-anthryloxy-palmitic acid (16-AP) (Molecular Probes Inc., Eugene 
OR). Ligand is dissolved in 100% ethanol and its concentration is estimated using known 
extinction coefficents (retinol: 46,000 A/M/cm at 325 nm; 16-AP: 8,200 A/M/cm at 361 nm). A 

20 700 ill aliquot of 1 /zM TRICH in 10 mM Tris (pH 7.5), 2 mM EDTA, and 500 mM NaCl is placed 
in a 1 cm path length quartz cuvette and 1 fil aliquots of ligand solution are added. Fluorescence is 
measured 100 seconds after each addition until readings are stable. Change in fluorescence per unit 
change in ligand concentration is proportional to TRICH activity. 
XVTII. Identification of TRICH Agonists and Antagonists 

25 TRICH is expressed in a eukaryotic cell line such as CHO (Chinese Hamster Ovary) or 

HEK (Human Embryonic Kidney) 293. Ion channel activity of the transformed cells is measured in 
the presence and absence of candidate agonists or antagonists. Ion channel activity is assayed using 
patch clamp methods well known in the art or as described in Example XVII. Alternatively, ion 
channel activity is assayed using fluorescent techniques that measure ion flux across the cell 

30 membrane (Velicelebi, G. et al. (1999) Meth. Enzymol. 294:20-47; West, M.R. and C.R. Molloy 
(1996) Anal. Biochem. 241:51-58). These assays may be adapted for high-throughput screening 
using microplates. Changes in internal ion concentration are measured using fluorescent dyes such 
as the Ca 2+ indicator Fluo-4 AM, sodium-sensitive dyes such as SBFI and sodium green, or the Cl" 
indicator MQAE (all available from Molecular Probes) in combination with the FLIPR fluorimetric 

35 plate reading system (Molecular Devices). In a more generic version of this assay, changes in 
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membrane potential caused by ionic flux across the plasma membrane are measured using oxonyl 
dyes such as DiBAC 4 (Molecular Probes). DiBAC 4 equilibrates between the extracellular solution 
and cellular sites according to the cellular membrane potential. The dye's fluorescence intensity is 
20-fold greater when bound to hydrophobic intracellular sites, allowing detection of DiBAC 4 entry 

5 into the cell (Gonzalez, J.E. and P.A. Negulescu (1998) Curr. Opin. Biotechnol. 9:624-631). 

Candidate agonists or antagonists may be selected from known ion channel agonists or antagonists, 
peptide libraries, or combinatorial chemical libraries. 

Various modifications and variations of the described methods and systems of the invention 
will be apparent to those skilled in the art without departing from the scope and spirit of the 

10 invention. Although the invention has been described in connection with certain embodiments, it 
should be understood that the invention as claimed should not be unduly limited to such specific 
embodiments. Indeed, various modifications of the described modes for carrying out the invention*' 
which are obvious to those skilled in molecular biology or related fields are intended to be within 
the scope of the following claims. 
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What is claimed is: 

1. An isolated polypeptide selected from the group consisting of: 

a) a polypeptide comprising an amino acid sequence selected from the group 
consisting of SEQ ID NO: 1-32, 

b) a polypeptide comprising a naturally occurring amino acid sequence at least 90% 
identical to an amino acid sequence selected from the group consisting of SEQ ID 
NO: 1-32, 

c) a biologically active fragment of a polypeptide having an amino acid sequence 
selected from the group consisting of SEQ ID NO: 1-32, and 

d) an immunogenic fragment of a polypeptide having an amino acid sequence selected 
from the group consisting of SEQ ID NO: 1-32. 

2. An isolated polypeptide of claim 1 comprising an amino acid sequence selected from the 
group consisting of SEQ ID NO: 1-32. 

3. An isolated polynucleotide encoding a polypeptide of claim 1. 

4. An isolated polynucleotide encoding a polypeptide of claim 2. 

5. An isolated polynucleotide of claim 4 comprising a polynucleotide sequence selected 
from the group consisting of SEQ ID NO:33-64. 

6. A recombinant polynucleotide comprising a promoter sequence operably linked to a 
polynucleotide of claim 3. 

7. A cell transformed with a recombinant polynucleotide of claim 6. 

8. A transgenic organism comprising a recombinant polynucleotide of claim 6. 

9. A method of producing a polypeptide of claim .1, the method comprising: 

a) culturing a cell under conditions suitable for expression of the polypeptide, wherein 
said cell is transformed with a recombinant polynucleotide, and said recombinant 
polynucleotide comprises a promoter sequence operably linked to a polynucleotide 
encoding the polypeptide of claim 1, and 
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10. A method of claim 9, wherein the polypeptide comprises an amino acid sequence 
selected from the group consisting of SEQ ID NO: 1-32. 

5 

1 1. An isolated antibody which specifically binds to a polypeptide of claim 1. 

12. An isolated polynucleotide selected from the group consisting of: 

a) a polynucleotide comprising a polynucleotide sequence selected from the group 
10 consisting of SEQ ID NO:33-64, 

b) . a polynucleotide comprising a naturally occurring polynucleotide sequence at least 
90% identical to a polynucleotide sequence selected from the group consisting of ' 
SEQE)NO:33-64, 

c) a polynucleotide complementary to a polynucleotide of a), 

15 d) a polynucleotide complementary to a polynucleotide of b), and 

e) an RNA equivalent of a)-d). 

13. An isolated polynucleotide comprising at least 60 contiguous nucleotides of a 
polynucleotide of claim 12. 

20 

14. A method of detecting a target polynucleotide in a sample, said target polynucleotide 
having a sequence of a polynucleotide of claim 12, the method comprising: 

a) hybridizing the sample with a probe comprising at least 20 contiguous nucleotides 
comprising a sequence complementary to said target polynucleotide in the sample, 

25 and which probe specifically hybridizes to said target polynucleotide, under 

conditions whereby a hybridization complex is formed between said probe and said 
target polynucleotide or fragments thereof, and 

b) detecting the presence or absence of said hybridization complex, and, optionally, if 
present, the amount thereof. 

30 

15. A method of claim 14, wherein the probe comprises at least 60 contiguous nucleotides. 



16. A method of detecting a target polynucleotide in a sample, said target polynucleotide 
having a sequence of a polynucleotide of claim 12, the method comprising: 
35 a) amplifying said target polynucleotide or fragment thereof using polymerase chain 

133 



V3DOCIO: <WO 024641 5A2J_> 



WO 02/46415 



PCT/US01/46963 



reaction amplification, and 
b) detecting the presence or absence of said amplified target polynucleotide or 
fragment thereof, and, optionally, if present, the amount thereof. 

17. A composition comprising a polypeptide of claim 1 and a pharmaceutically acceptable 
excipient. 

18. A composition of claim 17, wherein the polypeptide comprises an amino acid sequence 
selected from the group consisting of SEQ ID NO: 1-32. 

19. A method for treating a disease or condition associated with decreased expression of 
functional TR1CH. comprising administering to a patient in need of such treatment the - 
composition of claim 17. 

20. A method of screening a compound for effectiveness as an agonist of a polypeptide of 
claim 1, the method comprising: 

a) exposing a sample comprising a polypeptide of claim 1 to a compound, and 

b) delecting agonist activity in the sample. 

21. A composition comprising an agonist compound identified by a method of claim 20 and 
a pharmaceutically acceptable excipient. 

22. A method for treating a disease or condition associated with decreased expression of 
functional TR1CH, comprising administering to a patient in need of such treatment a 
composition of claim 21. 

23. A method of screening a compound for effectiveness as an antagonist of a polypeptide 
of claim 1, the method comprising: 

a) exposing a sample comprising a polypeptide of claim 1 to a compound, and 

b) detecting antagonist activity in the sample. 

24. A composition comprising an antagonist compound identified by a method of claim 23 
and a pharmaceutically acceptable excipient. 

25. A method for treating a disease or condition associated with overexpression of 
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functional TRICH, comprising administering to a patient in need of such treatment a 
composition of claim 24. 

26. A method of screening for a compound that specifically binds to the polypeptide of 
5 claim 1, the method comprising: 

a) combining the polypeptide of claim 1 with at least one test compound under 
suitable conditions, and 

b) detecting binding of the polypeptide of claim 1 to the test compound, thereby 
identifying a compound that specifically binds to the polypeptide of claim 1. 

10 

27. A method of screening for a compound that modulates the activity of the polypeptide of 
claim 1, the method comprising: ^ 
a) combining the polypeptide of claim 1 with at least one test compound under 

conditions permissive for the activity of the polypeptide of claim 1, 
15 b) assessing the activity of the polypeptide of claim 1 in the presence of the test 

compound, and 

c) comparing the activity of the polypeptide of claim Tin the presence of the test 
compound with the activity of the polypeptide of claim 1 in the absence of the test 
compound, wherein a change in the activity of the polypeptide of claim 1 in the 

20 presence of the test compound is indicative of a compound that modulates the 

activity of the polypeptide of claim 1. 

28. A method of screening a compound for effectiveness in altering expression of a target 
polynucleotide, wherein said target polynucleotide comprises a sequence of claim 5, the 

25 method comprising: 

a) exposing a sample comprising the target polynucleotide to a compound, under 
conditions suitable for the expression of the target polynucleotide, 

b) detecting altered expression of the target polynucleotide, and 

c) comparing the expression of the target polynucleotide in the presence of varying 
30 amounts of the compound and in the absence of the compound 

29. A method of assessing toxicity of a test compound, the method comprising: 

a ) treating a biological sample containing nucleic acids with the test compound, 

b) hybridizing the nucleic acids of the treated biological sample with a probe 

35 comprising at least 20 contiguous nucleotides of a polynucleotide of claim 12 under 
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conditions whereby a specific hybridization complex is formed between said probe 
and a target polynucleotide in the biological sample, said target polynucleotide 
comprising a polynucleotide sequence of a polynucleotide of claim 12 or fragment 
thereof, 

c) quantifying the amount of hybridization complex, and 

d) comparing the amount of hybridization complex in the treated biological sample 
with the amount of hybridization complex in an untreated biological sample, 
wherein a difference in the amount of hybridization complex in the treated 
biological sample is indicative of toxicity of the test compound. 



30. A diagnostic test for a condition or disease associated with the expression of TRICH in 
a biological sample, the method comprising: 

a) combining the biological sample with an antibody of claim 1 1 , under conditions 
suitable for the antibody to bind the polypeptide and form an antibody:polypeptide 

15 complex, and 

b) detecting the complex, wherein the presence of the complex correlates with the 
presence of the polypeptide in the biological sample. 

31. The antibody of claim 11, wherein the antibody is: 
20 a) a chimeric antibody, 

b) a single chain antibody, 

c) a Fab fragment, 

d) a F(ab') 2 fragment, or 

e) a humanized antibody. 



32. A composition comprising an antibody of claim 1 1 and an acceptable excipient. 



33. A method of diagnosing a condition or disease associated with the expression of 
TRICH in a subject, comprising administering to said subject an effective amount of the 

30 composition of claim 32. 

34. A composition of claim 32, wherein the antibody is labeled. 

35. A method of diagnosing a condition or disease associated with the expression of 

35 TRICH in a subject, comprising administering to said subject an effective amount of the 
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composition of claim 34. 

36. A method of preparing a polyclonal antibody with the specificity of the antibody of 
claim 1 1 , the method comprising: 

a) immunizing an animal with a polypeptide consisting of an amino acid sequence 
selected from the group consisting of SEQ ID NO: 1-32, or an immunogenic 
fragment thereof, under conditions to elicit an antibody response, 

b) isolating antibodies from said animal, and 

c) screening the isolated antibodies with the polypeptide, thereby identifying a 
polyclonal antibody which binds specifically to a polypeptide comprising an amino 
acid sequence selected from the group consisting of SEQ ID NO: 1-32. 

37. A polyclonal antibody produced by a method of claim 36. 

38. A composition comprising the polyclonal antibody of claim 37 and a suitable carrier. 

39. A method of making a monoclonal antibody with the specificity of the antibody of 
claim 1 1 , the method comprising: 

a) immunizing an animal with a polypeptide consisting of an amino acid sequence 
selected from the group consisting of SEQ ID NO: 1-32, or an immunogenic 
fragment thereof, under conditions to elicit an antibody response, 

b) isolating antibody producing cells from the animal, 

c) fusing the antibody producing cells with immortalized cells to form monoclonal 
antibody-producing hybridoma cells, 

d) culturing the hybridoma cells, and 

e) isolating from the culture monoclonal antibody which binds specifically to a 
polypeptide comprising an amino acid sequence selected from the group consisting 
ofSEQIDNO:l-32. 

40. A monoclonal antibody produced by a method of claim 39. 

41. A composition comprising the monoclonal antibody of claim 40 and a suitable carrier. 



42. The antibody of claim 1 1, wherein the antibody is produced by screening a Fab 
expression library. 

137 



WO 02/46415 



POYUS01/46963 



43. The antibody of claim 11, wherein the antibody is produced by screening a recombinant 
immunoglobulin library. 

44. A method of detecting a polypeptide comprising an amino acid sequence selected from 
5 the group consisting of SEQ ID NO: 1-32 in a sample, the method comprising: 

a) incubating the antibody of claim 1 1 with a sample under conditions to allow 
specific binding of the antibody and the polypeptide, and 

b) detecting specific binding, wherein specific binding indicates the presence of a 
polypeptide comprising an amino acid sequence selected from the group consisting 

10 of SEQ ID NO: 1-32 in the sample. 

45. A method of purifying a polypeptide comprising an amino acid sequence selected from 
the group consisting of SEQ ID NO: 1-32 from a sample, the method comprising: 

a) incubating the antibody of claim 11 with a sample under conditions to allow 
15 specific binding of the antibody and the polypeptide, and 

b) separating the antibody from the sample and obtaining the purified polypeptide 
comprising an amino acid sequence selected from the group consisting of SEQ ID 
NO: 1-32. 



20 46 - A microarray wherein at least one element of the microarray is a polynucleotide of 

claim 13. 

47. A method of generating an expression profile of a sample which contains 
polynucleotides, the method comprising: 

25 a) labeling the polynucleotides of the sample, 

b) contacting the elements of the microarray of claim 46 with the labeled 
polynucleotides of the sample under conditions suitable for the formation of a 
hybridization complex, and 

c) quantifying the expression of the polynucleotides in the sample. 

30 

48. An array comprising different nucleotide molecules affixed in distinct physical 
locations on a solid substrate, wherein at least one of said nucleotide molecules comprises a 
first oligonucleotide or polynucleotide sequence specifically hybridizable with at least 30 

contiguous nucleotides of a target polynucleotide, and wherein said target polynucleotide is 
35 a polynucleotide of claim 12. 
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49. An array of claim 48, wherein said first oligonucleotide or polynucleotide sequence is 
completely complementary to at least 30 contiguous nucleotides of said target 
polynucleotide. 

5 50. An array of claim 48, wherein said first oligonucleotide or polynucleotide sequence is 

completely complementary to at least 60 contiguous nucleotides of said target 
polynucleotide. 



51. An array of claim 48, wherein said first oligonucleotide or polynucleotide sequence is 
10 completely complementary to said target polynucleotide. 

52. An array of claim 48, which is a microarray. 

53. An array of claim 48, further comprising said target polynucleotide hybridized to a 
15 nucleotide molecule comprising said first oligonucleotide or polynucleotide sequence. 

54. An array of claim 48, wherein a linker joins at least one of said nucleotide molecules to 
said solid substrate. 



20 55. An array of claim 48, wherein each distinct physical location on the substrate contains 

multiple nucleotide molecules, and the multiple nucleotide molecules at any single distinct 
physical location have the same sequence, and each distinct physical location on the 
substrate contains nucleotide molecules having a sequence which differs from the sequence 
of nucleotide molecules at another distinct physical location on the substrate. 

'25 

56. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO: 1. 

57. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:2. 
30 58. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:3. 

59. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:4. 

60. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:5. 

35 
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61. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:6. 

62. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:7. 

63. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:8. 

64. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:9. 

65. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO: 10. 

66. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO: 1 1. 

67. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO: 12. 

68. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO: 13. 

69. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO: 14. 

70. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO: 15. 

71. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO: 16. 

72. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO: 17. 

73. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO: 18. 

74. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO: 19. 

75. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:20. 

76. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:21. 

77. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:22. 

78. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:23. 
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79. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:24. 

80. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:25. 

81 . A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:26. 

82. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:27. 

83. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:28. 

84. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:29. 

85. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:30. 
15 86. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:31. 

87. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:32. 

88. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 
20 NO:33. 

89. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 
NO:34. 

25 90. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:35. 

91. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 
NO:36. 

30 

92. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 
NO:37. 

93. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 
35 NO:38. 
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94. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 
NO:39. 

95. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 
5 NO:40. 

96. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 
NO:41. 

10 97. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:42. 

98. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 
NO:43. 

15 

99. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 
NO:44. 

100. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID' 
20 NO:45. 

101. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 
NO:46. 

25 102. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:47. 

103. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 
NO:4S. 

30 

104. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 
NO:49. 

105. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 
35 NO:50. 
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106. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 
NO:51. 

107. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 
5 NO:52. 

108. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 
NO:53. 

10 109. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:54. 

1 10. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 
NO:55. 

15 

1 1 1. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 
NO:56. 

1 12. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 
20 NO:57. 

113. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 
NO:58. 

25 114. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:59. 

1 15. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 
NO:60. 

30 

116. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 
NO:61. 

1 17. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 
35 NO:62. 
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1 18. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 
NO:63. 

1 19. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 
5 NO:64. 
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TRANSPORTERS AND ION CHANNELS 

This invention relates to nucleic acid and amino acid sequences of transporters and ion 
channels and to the use of these sequences in the diagnosis, treatment, and prevention of transport, 
5 neurological, muscle, immunological and cell proliferative disorders, and in the assessment of the 
effects of exogenous compounds on the expression of nucleic acid and amino acid sequences of 
transporters and ion channels. 

BACKGROUND OF THE INVENTION 

10 Eukaryotic cells are surrounded and subdivided into functionally distinct organelles by 

hydrophobic lipid bilayer membranes which are highly impermeable to most polar molecules. Cells 
and organelles require transport proteins to import and export essential nutrients and metal ions ' 
including K*, NH 4 + , ? iy S0 4 2 \ sugars, and vitamins, as well as various metabolic waste products. 
Transport proteins also play roles in antibiotic resistance, toxin secretion, ion balance, synaptic 

15 neurotransmission, kidney function, intestinal absorption, tumor growth, and other diverse cell 
functions (Griffith, J. and C Sansom (199S) The Transporter Facts Book . Academic Press, San 
Diego CA, pp. 3-29). Transport can occur by a passive concentration-dependent mechanism, or can 
be linked to an energy source such as ATP hydrolysis or an ion gradient. Proteins that function in 
transport include carrier proteins, which bind to a specific solute and undergo a conformational 

20 change that translocates the bound solute across the membrane, and channel proteins, which form 
hydrophilic pores that allow specific solutes to diffuse through the membrane down an 
electrochemical solute gradient. 

Carrier proteins which transport a single solute from one side of the membrane to the other 
are called uniporters. In contrast, coupled transporters link the transfer of one solute with 

25 simultaneous or sequential transfer of a second solute, either in the same direction (symport) or in 
the opposite direction (antiport). For example, intestinal and kidney epithelium contains a variety 
of symporter systems driven by the sodium gradient that exists across the plasma membrane. 
Sodium moves into the cell down its electrochemical gradient and brings the solute into the cell 
with it. The sodium gradient that provides the driving force for solute uptake is maintained by the 

30 ubiquitous Na"7K + ATPase system. Sodium-coupled transporters include the mammalian glucose 
transporter (SGLT1), iodide transporter (NIS), and multivitamin transporter (SMVT). All three 
transporters have twelve putative transmembrane segments, extracellular glycosylation sites, and 
cytoplasmically-oriented N- and C-termini. NIS plays a crucial role in the evaluation, diagnosis, 
and treatment of various thyroid pathologies because it is the molecular basis for radioiodide 

35 thyroid-imaging techniques and for specific targeting of radioisotopes to the thyroid gland (Levy, 
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O. et al. (1997) Proc. Natl. Acad. ScL USA 94:5568-5573). SMVT is expressed in the intestinal 
mucosa, kidney, and placenta, and is implicated in the transport of the water-soluble vitamins, e.g., 
biotin and pantothenate (Prasad, P.D. et al. (1998) J. Biol. Chem. 273:7501-7506). 

One of the largest families of transporters is the major facilitator superfamily (MFS), also 

5 called the uniporter-symporter-antiporter family. MFS transporters are single polypeptide carriers 
that transport small solutes in response to ion gradients. Members of the MFS are found in all 
classes of living organisms, and include transporters for sugars, oligosaccharides, phosphates, 
nitrates, nucleosides, monocarboxylates, and drugs. MFS transporters found in eukaryotes all have 
a structure comprising 12 transmembrane segments (Pao, S.S. et al. (1998) Microbiol. Molec. Biol. 

0 Rev. 62: 1-34). The largest family of MFS transporters is the sugar transporter family, which 

includes the seven glucose transporters (GLUT1-GLUT7) found in humans that are required for the 
transport of glucose and other hexose sugars. These glucose transport proteins have unique tissue 
distributions and physiological functions. GLUT1 provides many cell types with their basal glucose 
requirements and transports glucose across epithelial and endothelial barrier tissues; GLUT2 

5 facilitates glucose uptake or efflux from the liver; GLUT3 regulates glucose supply to neurons; 
GLUT4 is responsible for insulin-regulated glucose disposal; and GLUT5 regulates fructose uptake 
into skeletal muscle. Defects in glucose transporters are involved in a recently identified 
neurological syndrome causing infantile seizures and developmental delay, as well as glycogen 
storage disease, Fanconi-Bickel syndrome, and non-insulin-dependent diabetes mellitus (Mueckler, 

0 M. (1994) Eur. J. Biochem. 219:713-725; Longo, N. and L.J. Elsas (1998) Adv. Pediatr. 45:293- 
313). 

Monocarboxylate anion transporters are proton-coupled symporters with a broad substrate 
specificity that includes L-lactate, pyruvate, and the ketone bodies acetate, acetoacetate, and 
beta-hydroxybutyrate. At least seven isoforms have been identified to date. The isoforms are 

5 predicted to have twelve transmembrane (TM) helical domains with a large intracellular loop 
between TM6 and TM7, and play a critical role in maintaining intracellular pH by removing the 
protons that are produced stoichiometrically with lactate during glycolysis. The best characterized 
H + -rnonocarboxylate transporter is that of the erythrocyte membrane, which transports L-lactate and 
a wide range of other aliphatic monocarboxylates. Other cells possess HMinked monocarboxylate 

0 transporters with differing substrate and inhibitor selectivities. In particular, cardiac muscle and 
tumor cells have transporters that differ in their values for certain substrates, including 
stereoselectivity for L- over D-lactate, and in their sensitivity to inhibitors. There are 
Na + -monocarboxylate cotransporters on the luminal surface of intestinal and kidney epithelia, which 
allow the uptake of lactate, pyruvate, and ketone bodies in these tissues. In addition, there are 

5 specific and selective transporters for organic cations and organic anions in organs including the 
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kidney, intestine and liver. Organic anion transporters are selective for hydrophobic, charged 
molecules with electron-attracting side groups. Organic cation transporters, such as the ammonium 
transporter, mediate the secretion of a variety of drugs and endogenous metabolites, and contribute 
to the maintenance of intercellular pH (Poole, R.C. and A.P. Halestrap (1993) Am J. Physiol. 
5 264:C761-C782; Price, N.T. et al. (1998) Biochem. J. 329:321-328; and Martinelle, K. and I. 
Haggstrom (1993) J. Biotechnol. 30:339-350). 

ATP-binding cassette (ABC) transporters are members of a superfamily of membrane 
proteins that transport substances ranging from small molecules such as ions, sugars, amino acids, 
peptides, and phospholipids, to lipopeptides, large proteins, and complex hydrophobic drugs. ABC 

10 transporters consist of four modules: two nucleotide-binding domains (NBD), which hydrolyze 
ATP to supply the energy required for transport, and two membrane-spanning domains (MSD), each 
containing six putative transmembrane segments. These four modules may be encoded by a single^ 
gene, as is the case for the cystic fibrosis transmembrane regulator (CFTR), or by separate genes. 
When encoded by separate genes, each gene product contains a single NBD and MSD. These "half- 

15 molecules" form homo- and heterodimers, such as Tapl and Tap2, the endoplasmic reticulum-based 
major histocompatibility (MHC) peptide transport system. Several genetic diseases are attributed to 
defects in ABC transporters, such as the following diseases and their corresponding proteins: cystic 
fibrosis (CFTR, an ion channel), adrenoleukodystrophy (adrenoleukodystrophy protein, ALDP), 
Zellweger syndrome (peroxisomal membrane protein-70, PMP.70), and hyperinsulinemic 

20 hypoglycemia (sulfonylurea receptor, SUR). Overexpression of the multidrug resistance (MDR) 
protein, another ABC transporter, in human cancer cells makes the cells resistant to a variety of 
cytotoxic drugs used in chemotherapy (Taglicht, D. and S. Michaelis (1998) Meth. Enzymol. 
292:130-162). 

A number of metal ions such as iron, zinc, copper, cobalt, manganese, molybdenum, 
25 selenium, nickel, and chromium are important as cofactors for a number of enzymes. For example, 
copper is involved in hemoglobin synthesis, connective tissue metabolism, and bone development, 
by acting as a cofactor in oxidoreductases such as superoxide dismutase, ferroxidase 
(ceruloplasmin), and lysyl oxidase. Copper and other metal ions must be provided in the diet, and 
are absorbed by transporters in the gastrointestinal tract. Plasma proteins transport the metal ions to 
30 the liver and other target organs, where specific transporters move the ions into cells and cellular 
organelles as needed. Imbalances in metal ion metabolism have been associated with a number of 
disease states (Danks, D.M. (1986) J. Med. Genet. 23:99-106). 

P-type ATPases comprise a class of cation-transporting transmembrane proteins. They are 
integral membrane proteins which use an aspartyl phosphate intermediate to move cations across a 
35 membrane. Features of P-type ATPases include: (i) a cation channel; (ii) a stalk, formed by 
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extensions of the transmembrane a-helices into the cytoplasm; (iii) an ATP binding domain; (iv) a 
phosphorylated aspartic acid; (v) an adjacent transduction domain; (vi) a phosphatase domain, 
which removes the phosphate from the aspartic acid as part of the reaction cycle; and (vii) six or 
more transmembrane domains. Included in this class are heavy metal-transporting ATPases as well 
5 as aminophospholipid transporters. 

The transport of phosphatidylserine and phosphatidylethanolamine by aminophospholipid 
translocase results in the movement of these molecules from one side of a bilayer to another. This 
transport is conducted by a newly identified subfamily of P-type ATPases which are proposed to be 
amphipath transporters. Amphipath transporters move molecules having both a hydropbilic and a 
10 hydrophobic region. As many as seventeen different genes belong to this P-type ATPases 

subfamily, being grouped into several distinct classes and subclasses (Halleck, M.S. et al., (1999) 
Physiol. Genomics 1:139-150; Vulpe.C. Et al., (1993) Nat. Genet. 3:7-13). 

Transport of fatty acids across the plasma membrane can occur by diffusion, a high 
capacity, low affinity process. However, under normal physiological conditions a significant 
15 fraction of fatty acid transport appears to occur via a high affinity, low capacity protein-mediated 
transport process. Fatty acid transport protein (FATP), an integral membrane protein with four 
transmembrane segments, is expressed in tissues exhibiting high levels of plasma membrane fatty 
acid flux, such as muscle, heart, and adipose. Expression of FATP is upregulated in 3T3-L1 cells 
during adipose conversion, and expression in COS7 fibroblasts elevates uptake of long-chain fatty 
20 acids (Hui, T.Y. et al. (1998) J. Biol. Chem. 273:27420-27429). 

The Iipocalin superfamily constitutes a phylogenetically conserved group of more than forty 
proteins that function as extracellular ligand-binding proteins which bind and transport small 
hydrophobic molecules. Members of this family function as carriers of retinoids, odorants, 
chromophores, pheromones, allergens, and sterols, and in a variety of processes including nutrient 
25 transport, cell growth regulation, immune response, and prostaglandin synthesis. A subset of these 
proteins may be multifunctional, serving as either a biosynthetic enzyme or as a specific enzyme 
inhibitor. (Tanaka, T. et al. (1997) J. Biol. Chem. 272: 15789-15795; and van't Hof, W. et al. (1997) 
J. Biol. Chem. 272: 1837-1 S41.) 

Members of the Iipocalin family display unusually low levels of overall sequence 
30 conservation. Pairwise sequence identity often falls below 20%. Sequence similarity between 
family members is limited to conserved cysteines which form disulfide bonds and three motifs 
which form a juxtaposed cluster that functions as a target cell recognition site. The lipocalins share 
an eight stranded, anti-parallel beta-sheet which folds back on itself to form a continuously 
hydrogen-bonded beta-barrel. The pocket formed by the barrel functions as an internal ligand 
35 binding site. Seven loops (LI to L7) form short beta-hairpins, except loop LI which is a large 
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omega loop that forms a lid to partially close the internal ligand-binding site (Flower (1996) 
Biochem. J. 318:1-14). 

Lipocalins are important transport molecules. Each lipocalin associates with a particular 
ligand and delivers that ligand to appropriate target sites within the organism. Retinol-binding 
5 protein (RBP), one of the best characterized lipocalins, transports retinol from stores within the liver 
to target tissues. Apolipoprotein D (apo D), a component of high density lipoproteins (HDLs) and 
low density lipoproteins (LDLs), functions in the targeted collection and delivery of cholesterol 
throughout the body. Lipocalins are also involved in cell regulatory processes. Apo D, which is 
identical to gross-cystic-disease-fluid protein (GCDFP)-24, is a progesterone/pregnenolone-binding 

10 protein expressed at high levels in breast cyst fluid. Secretion of apo D in certain human breast 
cancer cell lines is accompanied by reduced cell proliferation and progression of cells to a more 
differentiated phenotype. Similarly, apo D and another lipocalin, a r acid glycoprotein (AGP), are y 
involved in nerve cell regeneration. AGP is also involved in anti-inflammatory and 
immunosuppressive activities. AGP is one of the positive acute-phase proteins (APP); circulating 

15 levels of AGP increase in response to stress and inflammatory stimulation. AGP accumulates at 
sites of inflammation where it inhibits platelet and neutrophil activation and inhibits phagocytosis. 
The immunomodulatory properties of AGP are due to glycosylation. AGP is 40% carbohydrate, 
making it unusually acidic and soluble. The glycosylation pattern of AGP changes during acute- 
phase response, and deglycosylated AGP has no immunosuppressive activity (Hower (1994) FEBS 

20 Lett. 354:7-1 1; Flower (1996) supra). 

The lipocalin superfamily also includes several animal allergens, including the mouse major 
urinary protein (mMUP), the rat a-2-microgloobulin (rA2U), the bovine P-lactoglobulin (pig), the 
cockroach allergen (Bla g4), bovine dander allergen (Bos d2), and the major horse allergen, 
designated Eguus caballus allergen 1 (Equ cl). Equ cl is a powerful allergen responsible for about 

25 80% of anti-horse IgE antibody response in patients who are chronically exposed to horse allergens. 
It appears that lipocalins may contain a common structure that is able to induce the IgE response 
(Gregoire, C. et al„ (1996) J. Biol. Chem. 271:32951-32959). 

Lipocalins are used as diagnostic and prognostic markers in a variety of disease states. The 
plasma level of AGP is monitored during pregnancy and in diagnosis and prognosis of conditions 

30 including cancer chemotherapy, renal disfunction, myocardial infarction, arthritis, and multiple 
sclerosis. RBP is used clinically as a marker of tubular reabsorption in the kidney, and apo D is a 
marker in gross cystic breast disease (Flower (1996) supra). Additionally, the use of lipocalin 
animal allergens may help in the diagnosis of allergic reactions to horses (Gregoire supra) , pigs, 
cockroaches, mice and rats. 

35 Mitochondrial carrier proteins are transmembrane-spanning proteins which transport ions 

5 



OOCID: <WO 0246415A2_IA> 



WO 02/46415 PCT/US01/46963 

and charged metabolites between the cytosol and the mitochondrial matrix. Examples include the 
ADP, ATP carrier protein; the 2-oxoglutarate/malate carrier; the phosphate carrier protein; the 
pyruvate carrier; the dicarboxylate carrier which transports malate, succinate, fumarate, and 
phosphate; the tricarboxylate carrier which transports citrate and malate; and the Grave's disease 
5 carrier protein, a protein recognized by IgG in patients with active Grave's disease, an autoimmune 
disorder resulting in hyperthyroidism. Proteins in this family consist of three tandem repeats of an 
approximately 100 amino acid domain, each of which contains two transmembrane regions (Stryer, 
L. (1995) Biochemistry, W.H. Freeman and Company, New York NY, p. 551; PROSUE 
PDOC00189 Mitochondrial energy transfer proteins signature; Online Mendelian Inheritance in 

10 Man (OMDvl) *275000 Graves Disease). 

This class of transporters also includes the mitochondrial uncoupling proteins, which create 
proton leaks across the inner mitochondrial membrane, thus uncoupling oxidative phosphorylation' 
from ATP synthesis. The result is energy dissipation in the form of heat. Mitochondrial uncoupling 
proteins have been implicated as modulators of thermoregulation and metabolic rate, and have been 

15 proposed as potential targets for drugs against metabobc diseases such as obesity (Ricquier, D. et al. 
(1999) J. Int. Med. 245:637-642). 
Ion Channels 

The electrical potential of a cell is generated and maintained by controlling the movement 
of ions across the plasma membrane. The movement of ions requires ion channels, which form ion- 

20 selective pores within the membrane. There are two basic types of ion channels, ion transporters 
and gated ion channels. Ion transporters utilize the energy obtained from ATP hydrolysis to actively 
transport an ion against the ion's concentration gradient. Gated ion channels allow passive flow of 
an ion down the ion's electrochemical gradient under restricted conditions. Together, these types of 
ion channels generate, maintain, and utilize an electrochemical gradient that is used in 1) electrical 

25 impulse conduction down the axon of a nerve cell, 2) transport of molecules into cells against 
concentration gradients, 3) initiation of muscle contraction, and 4) endocrine cell secretion. 
Ion Transporters 

Ion transporters generate and maintain the resting electrical potential of a cell. Utilizing the 
energy derived from ATP hydrolysis, they transport ions against the ion' s concentration gradient. 

30 These transmembrane ATPases are divided into three families. The phosphorylated (P) class ion 
transporters, including Na*-K + ATPase, Ca 2+ -ATPase, and H*-ATPase, are activated by a 
phosphorylation event. P-class ion transporters, also known as E1-E2 type ATPases, are responsible 
for maintaining resting potential distributions such that cytosolic concentrations of Na* and Ca 2+ are 
low and cytosolic concentration of K + is high. The vacuolar (V) class of ion transporters includes 

35 H + pumps on intracellular organelles, such as lysosomes and Golgi. V-class ion transporters are 
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responsible for generating the low pH within the lumen of these organelles that is required for 
function. The coupling factor (F) class consists of H + pumps in the mitochondria. F-class ion 
transporters utilize a proton gradient to generate ATP from ADP and inorganic phosphate (Pj). 

The P-ATPases are hexamers of a 100 kD subunit with ten transmembrane domains and 
5 several large cytoplasmic regions that may play a role in ion binding (Scarborough, G.A. (1999) 
Curr. Opin. Cell Biol. 11:517-522). The V-ATPases are composed of two functional domains: the 
Vj domain, a peripheral complex responsible for ATP hydrolysis; and the V 0 domain, an integral 
complex responsible for proton translocation across the membrane. The F-ATPases are structurally 
and evolutionarily related to the V-ATPases. The F-ATPase F 0 domain contains 12 copies of the c 

10 subunit, a highly hydrophobic protein composed of two transmembrane domains and containing a 
single buried carboxyl group in TM2 that is essential for proton transport The V-ATPase V 0 
domain contains three types of homologous c subunits with four or five transmembrane domains 
and the essential carboxyl group in TM4 or TM3. Both types of complex also contain a single a 
subunit that may be involved in regulating the pH dependence of activity (Forgac, M. (1999) J. 

15 Biol. Chem. 274:12951-12954). 

The resting potential of the cell is utilized in many processes involving carrier proteins and 
gated ion channels. Carrier proteins utilize the resting potential to transport molecules into and out 
of the cell. Amino acid and glucose transport into many cells is linked to sodium ion co-transport 
(symport) so that the movement of Na + down an electrochemical gradient drives transport of the 

20 other molecule up a concentration gradient. Similarly, cardiac muscle links transfer of Ca 2+ out of 
the cell with transport of Na + into the cell (antiport). 
Gated Ion Channels 

Gated ion channels control ion flow by regulating the opening and closing of pores. The 
ability to control ion flux through various gating mechanisms allows ion channels to mediate such 

25 diverse signaling and homeostatic functions as neuronal and endocrine signaling, muscle 

contraction, fertilization, and regulation of ion and pH balance. Gated ion channels are categorized 
according to the manner of regulating the gating function. Mechanically-gated channels open their 
pores in response to mechanical stress; voltage-gated channels (e.g., Na + , K + , Ca 2+ , and Cl'channels) 
open their pores in response to changes in membrane potential; and ligand-gated channels (e.g., 

30 acetylcholine-, serotonin-, and glutamate-gated cation channels, and GAB A- and glycine-gated 
chloride channels) open their pores in the presence of a specific ion, nucleotide, or neurotransmitter. 
The gating properties of a particular ion channel (i.e., its threshold for and duration of opening and 
closing) are sometimes modulated by association with auxiliary channel proteins and/or post 
translational modifications, such as phosphorylation. 

35 Mechanically-gated or mechanosensitive ion channels act as transducers for the senses of 
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touch, hearing, and balance, and also play important roles in cell volume regulation, smooth muscle 
contraction, and cardiac rhythm generation. A stretch-inactivated channel (SIC) was recently 
cloned from rat kidney. The SIC channel belongs to a group of channels which are activated by 
pressure or stress on the cell membrane and conduct both Ca 2+ and Na + (Suzuki, M. et al. (1999) J. 
5 Biol. Chem. 274:6330-6335). 

The pore-forming subunits of the voltage-gated cation channels form a superfamily of ion 
channel proteins. The characteristic domain of these channel proteins comprises six transmembrane 
domains (S1-S6), a pore-forming region (P) located between S5 and S6, and intracellular amino and 
carboxy termini. In the Na + and Ca 2+ subfamilies, this domain is repeated four times, while in the 

10 K+ channel subfamily, each channel is formed from a tetramer of either identical or dissimilar 
subunits. The P region contains information specifying the ion selectivity for the channel. In the 
case of K+ channels, a GYG tripeptide is involved in this selectivity (Ishii, T.M. et al. (1997) Proc: ' 
Natl. Acad. Sci. USA 94:11651-11656). 

Voltage-gated Na + and K + channels are necessary for the function of electrically excitable 

15 cells, such as nerve and muscle cells. Action potentials, which lead to neurotransmitter release and 
muscle contraction, arise from large, transient changes in the permeability of the membrane to Na + 
and K + ions. Depolarization of the membrane beyond the threshold level opens voltage-gated Na + 
channels. Sodium ions flow into the cell, further depolarizing the membrane and opening more 
voltage-gated Na + channels, which propagates the depolarization down the length of the cell. 

20 Depolarization also opens voltage-gated potassium channels. Consequently, potassium ions flow 
outward, which leads to repolarization of the membrane. Voltage-gated channels utilize charged 
residues in the fourth transmembrane segment (S4) to sense voltage change. The open state lasts 
only about 1 millisecond, at which time the channel spontaneously converts into an inactive state 
that cannot be opened irrespective of the membrane potential. Inactivation is mediated by the 

25 channel's N-terminus, which acts as a plug that closes the pore. The transition from an inactive to a 
closed state requires a return to resting potential. 

Voltage-gated Na + channels are heterotrimeric complexes composed of a 260 kDa pore- 
forming a subunit that associates with two smaller auxiliary subunits, pi and f*2. The p2 subunit is 
a integral membrane glycoprotein that contains an extracellular Ig domain, and its association with a 

30 and pi subunits correlates with increased functional expression of the channel, a change in its 
gating properties, as well as an increase in whole cell capacitance due to an increase in membrane 
surface area (Isom, L.L. et al. (1995) Cell 83:433-442). 

Non voltage-gated Na* channels include the members of the amiloride-sensitive Na* 
channel/degenerin (NaC/DEG) family. Channel subunits of this family are thought to consist of two 

35 transmembrane domains flanking a long extracellular loop, with the amino and carboxyl termini 
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located within the cell. The NaC/DEG family includes the epithelial Na + channel (ENaC) involved 
in Na + reabsorption in epithelia including the airway, distal colon, cortical collecting duct of the 
kidney, and exocrine duct glands. Mutations in ENaC result in pseudohypoaldosteronism type 1 
and Liddle's syndrome (pseudohyperaldosteronism). The NaC/DEG family also includes the 
5 recently characterized KF-gated cation channels or acid-sensing ion channels (ASIC). ASIC 
subunits are expressed in the brain and form heteromultimeric Na + -permeable channels. These 
channels require acid pH fluctuations for activation. ASIC subunits show homology to the 
degenerins, a family of mechanically-gated channels originally isolated from C elegans. Mutations 
in the degenerins cause neurodegeneration. ASIC subunits may also have a role in neuronal 

10 function, or in pain perception, since tissue acidosis causes pain (Waldmann, R. and M. Lazdunski 
(1998) Out. Opin. Neurobiol. S:418-424; Eglen, R.M. et al. (1999) Trends Pharmacol. Sci. 20:337- 
342). • • • • . .. 

K + channels are located in all cell types, and may be regulated by voltage, ATP 
concentration, or second messengers such as Ca 2+ and cAMP. In non-excitable tissue, K* channels 

15 are involved in protein synthesis, control of endocrine secretions, and the maintenance of osmotic 
equilibrium across membranes. In neurons and other excitable cells, in addition to regulating action 
potentials and repolarizing membranes, K + channels are responsible for setting the resting 
membrane potential. The cytosol contains non-diffusible anions and, to balance this net negative 
charge, the cell contains a Na + -K* pump and ion channels that provide the redistribution of Na + , K + , 

20 and CI*. The pump actively transports Na + out of the cell and K + into the cell in a 3:2 ratio. Ion 
channels in the plasma membrane allow K + and CI " to flow by passive diffusion. Because of the 
high negative charge within the cytosol, CI * flows out of the cell. The flow of K + is balanced by an 
electromotive force pulling K* into the cell, and a K + concentration gradient pushing K+ out of the 
cell. Thus, the resting membrane potential is primarily regulated by K + flow (Salkoff, L. and T. 

25 Jegla (1995) Neuron 15:489-492). 

Potassium channel subunits of the Shaker -like superfamily all have the characteristic six 
transmembrane/1 pore domain structure. Four subunits combine as homo- or heterotetramers to 
form functional K channels. These pore-forming subunits also associate with various cytoplasmic P 
subunits that alter channel inactivation kinetics. The Shaker -like channel family includes the 

30 voltage-gated K + channels as well as the delayed rectifier type channels such as the human ether-a- 
go-go related gene (HERG) associated with long QT, a cardiac dysiythmia syndrome (Curran, M.E. 
(1998) Curr. Opin. Biotechnol. 9:565-572; Kaczorowski, G.J. and M.L. Garcia (1999) Curr. Opin. 
Chem. Biol. 3:448-458). 

A second superfamily of K + channels is composed of the inward rectifying channels (Kir). 

35 Kir channels have the property of preferentially conducting K + currents in the inward direction. 
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These proteins consist of a single potassium selective pore domain and two transmembrane 
domains, which correspond to the fifth and sixth transmembrane domains of voltage-gated K* 
channels. Kir subunits also associate as tetramers. The Kir family includes ROMK1, mutations in 
which lead to Bartter syndrome, a renal tubular disorder. Kir channels are also involved in 
5 regulation of cardiac pacemaker activity, seizures and epilepsy, and insulin regulation (Doupnik, 
C.A. et al. (1995) Curr. Opin. Neurobiol. 5:268-277; Curran, supra) . 

The recently recognized TWTK K* channel family includes the mammalian TWIK-1, 
TREK-1 and TASK proteins. Members of this family possess an overall structure with four 
transmembrane domains and two P domains. These proteins are probably involved in controlling 

10 the resting potential in a large set of cell types (Duprat, F. et al. (1997) EMBO J 16:5464-5471). 

The voltage-gated Ca 2 * channels have been classified into several subtypes based upon their 
electrophysiological and pharmacological characteristics. L-type Ca 2+ channels are predominantly 
expressed in heart and skeletal muscle where they play an essential role in excitation-contraction 
coupling. T-type channels are important for cardiac pacemaker activity, while N-type and P/Q-type 

15 channels are involved in the control of neurotransmitter release in the central and peripheral nervous 
system. The L-type and N-type voltage-gated Ca 2+ channels have been purified and, though their 
functions differ dramatically, they have similar subunit compositions. The channels are composed 
of three subunits. The a! subunit forms the membrane pore and voltage sensor, while the 0^5 and p 
subunits modulate the voltage-dependence, gating properties, and the current amplitude of the 

20 channel. These subunits are encoded by at least six a,, one c^S, and four p genes. A fourth subunit, 
Y» has been identified in skeletal muscle (Walker, D. et al. (1998) J. Biol. Chem. 273:2361-2367; 
McCleskey, E.W. (1994) Curr. Opin. Neurobiol. 4:304-312). 

The high-voltage-activated Ca 2+ channels that have been characterized biochemically 
include complexes of a pore-forming alpha 1 subunit of approximately 190-250 kDa; a 

25 transmembrane complex of alpha2 and delta subunits; an intracellular beta subunit; and in some 
cases a transmembrane gamma subunit. A variety of alpha 1 subunits, alpha2delta complexes, beta 
subunits, and gamma subunits are known. The Cavl family of alphal subunits conduct L-type Ca 2+ 
currents, which initiate muscle contraction, endocrine secretion, and gene transcription, and are 
regulated primarily by second messenger-activated protein phosphorylation pathways. The Cav2 

30 family of alphal subunits conduct N-type, P/Q-type, and R-type Ca 2+ currents, which initiate rapid 
synaptic transmission and are regulated primarily by direct interaction with G proteins and SNAKE 
proteins and secondarily by protein phosphorylation. The Cav3 family of alphal subunits conduct 
T-type Ca 2+ currents, which are activated and inactivated more rapidly and at more negative 
membrane potentials than other Ca 2 + current types. The distinct structures and patterns of 

35 regulation of these three families of Ca 2 * channels provide an array of Ca 2+ entry pathways in 
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response to changes in membrane potential and a range of possibilities for regulation of Ca 2+ entry 
by second messenger pathways and interacting proteins (Catterall, W.A. (2000) Annu. Rev. Cell 
Dev. Biol. 16:521-555). 

The alpha-2 subunit of the voltage-gated Ca 2 *-channel may include one or more Cache 
5 domains. An extracellular Cache domain may be fused to an intracellular catalytic domain, such as 
the histidine kinase, PP2C phosphatase, GGDEF (a predicted diguanylate cyclase), HD-GYP (a 
predicted phosphodiesterase) or adenylyl cyclase domain, or to a noncatalytic domain, like the 
methyl-accepting, DNA-binding winged helix-tum-helix, GAF, PAS or HAMP (a domain found in 
istidine kinases, denylyl cyclases, ethyl-binding proteins and phosphatases). Small molecules are 

10 bound via the Cache domain and this signal is converted into diverse outputs depending on the 

intracellular domains ( Aiuntruraman, V. and Aravind, L.(2000) Trends Biochem. Sci. 25:535-537). 

. The transient receptor family (Tip) of calcium ion channels are thought to mediate * 
capacitative calcium entry (CCE). CCE is the Ca 2+ influx into cells to resupply Ca 2+ stores depleted 
by the action of inositol triphosphate (IP3) and other agents in response to numerous hormones and 

15 growth factors. Trp and Trp-likc were first cloned from Drosophila and have similarity to voltage 
gated Ca 2+ channels in the S3 through S6 regions. This suggests that Trp and/or related proteins 
may form mammalian CCE channels (Zhu, X. et al. (1996) Cell 85:661-671; Boulay, G. et al. (1997) 
J. Biol. Chem. 272:2%72-296S0). Melastatin is a gene isolated in both the mouse and human, 
whose expression in melanoma cells is inversely correlated with melanoma aggressiveness in vivo . 

20 The human cDNA transcript corresponds to a 1533-amino acid protein having homology to 
members of the Trp family. It has been proposed that the combined use of malastatin mRNA 
expression status and tumor thickness might allow for the deterrnination of subgroups of patients at 
both low and high risk for developing metastatic disease (Duncan, L.M. et al (2001) J. Clin. Oncol. 
19:568-576). 

25 Chloride channels are necessary in endocrine secretion and in regulation of cytosolic and 

organelle pH. In secretory epithelial cells, CI ' enters the cell across a basolateral membrane through 
an Na + , K7C1 * cotransporter, accumulating in the cell above its electrochemical equilibrium 
concentration. Secretion of CI ' from the apical surface, in response to hormonal stimulation, leads 
to flow of Na* and water into the secretory lumen. The cystic fibrosis transmembrane conductance 

30 regulator (CFTR) is a chloride channel encoded by the gene for cystic fibrosis, a common fatal 
genetic disorder in humans. CFTR is a member of the ABC transporter family, and is composed of 
two domains each consisting of six transmembrane domains followed by a nucleotide-binding site. 
Loss of CFTR function decreases transepithelial water secretion and, as a result, the layers of mucus 
that coat the respiratory tree, pancreatic ducts, and intestine are dehydrated and difficult to clear. 

35 The resulting blockage of these sites leads to pancreatic insufficiency, "meconium ileus", and 
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devastating "chronic obstructive pulmonary disease" (Al-Awqati, Q. et al. (1992) J. Exp. Biol. 
172:245-266). 

The voltage-gated chloride channels (CLC) are characterized by 10-12 transmembrane 
domains, as well as two small globular domains known as CBS domains. The CLC subunits 
5 probably function as homotetramers. CLC proteins are involved in regulation of cell volume, 
membrane potential stabilization, signal transduction, and transepithelial transport Mutations in 
CLC-1, expressed predominantly in skeletal muscle, are responsible for autosomal recessive 
generalized myotonia and autosomal dominant myotonia congenita, while mutations in the kidney 
channel CLC-5 lead to kidney stones (Jentsch, TJ. (1996) Curr. Opin. Neurobiol. 6:303-310). 

10 Ligand-gated channels open their pores when an extracellular or intracellular mediator 

binds to the channel. Neurotransmitter-gated channels are channels that open when a 
neurotransmitter binds to their extracellular domain. These channels exist in the postsynaptic 
membrane of nerve or muscle cells. There are two types of neurotransmitter-gated channels. 
Sodium channels open in response to excitatory neurotransmitters, such as acetylcholine, glutamate, 

15 and serotonin. This opening causes an influx of Na + and produces the initial localized 

depolarization that activates the voltage-gated channels and starts the action potential. Chloride 
channels open in response to inhibitory neurotransmitters, such as y-aminobutyric acid (GABA) and 
glycine, leading to hyperpolarization of the membrane and the subsequent generation of an action 
potential. Neurotransmitter-gated ion channels have four transmembrane domains and probably 

20 function as pentamers (Jentsch, supra) . Amino acids in the second transmembrane domain appear to 
be important in determining channel permeation and selectivity (Sather, W.A. et al. (1994) Curr. 
Opin. Neurobiol. 4:313-323). 

Ligand-gated channels can be regulated by intracellular second messengers. For example, 
calcium-activated K + channels are gated by internal calcium ions. In nerve cells, an influx of 

25 calcium during depolarization opens K + channels to modulate the magnitude of the action potential 
(Ishi et al., supra) . The large conductance (BK) channel has been purified from brain and its subunit 
composition determined. The a subunit of the BK channel has seven rather than six transmembrane 
domains in contrast to voltage-gated K + channels. The extra transmembrane domain is located at 
the subunit N-terminus. A 2S-amino-acid stretch in the C-terminal region of the subunit (the 

30 "calcium bowl" region) contains many negatively charged residues and is thought to be the region 
responsible for calcium binding. The P subunit consists of two transmembrane domains connected 
by a glycosylated extracellular loop, with intracellular N- and C-termini (Kaczorowski, supra : 
Vergara, C. et al. (1998) Curr. Opin. Neurobiol. 8:321-329). 

Cyclic nucleotide-gated (CNG) channels are gated by cytosolic cyclic nucleotides. The best 

35 examples of these are the cAMP-gated Na + channels involved in olfaction and the cGMP-gated 
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cation channels involved in vision. Both systems involve ligand-mediated activation of a G-protein 
coupled receptor which then alters the level of cyclic nucleotide within the cell CNG channels also 
represent a major pathway for Ca 2+ entry into neurons, and play roles in neuronal development and 
plasticity. CNG channels are tetramers containing at least two types of subunits, an a subunit which 
5 can form functional homomeric channels, and a P subunit, which modulates the channel properties. 
All CNG subunits have six transmembrane domains and a pore forming region between the fifth and 
sixth transmembrane domains, similar to voltage-gated K + channels. A large C-terminal domain 
contains a cyclic nucleotide binding domain, while the N-terminal domain confers variation among 
channel subtypes (Zufall, R et al. (1997) Curr. Opin. Neurobiol. 7:404-412). 

10 The activity of other types of ion channel proteins may also be modulated by a variety of 

intracellular signalling proteins. Many channels have sites for phosphorylation by one or more 
protein kinases including protein kinase A, protein kinase C, tyrosine kinase, and casein kinase n, 
all of which regulate ion channel activity in cells. Kir channels are activated by the binding of the 
Gffy subunits of heterotrimeric G-proteins (Reimann, F. and F.M. Ashcroft (1999) Curr. Opin. Cell. 

15 Biol. 1 1:503-508). Other proteins are involved in the localization of ion channels to specific sites in 
the cell membrane. Such proteins include the PDZ domain proteins known as MAGUKs 
(membrane-associated guanylate kinases) which regulate the clustering of ion channels at neuronal 
synapses (Craven, S.E. and D.S. Bredt (199S) Cell 93:495-498). 
Disease Correlation 

20 The etiology of numerous human diseases and disorders can be attributed to defects in the 

transport of molecules across membranes. Defects in the trafficking of membrane-bound 
transporters and ion channels are associated with several disorders, e.g., cystic fibrosis, 
glucose-galactose malabsorption syndrome, hypercholesterolemia, von Gierke disease, and certain 
forms of diabetes mellitiis. Single-gene defect diseases resulting in an inability to transport small 

25 molecules across membranes include, e.g., cystinuria, iminoglycinuria, Hartup disease, and Fanconi 
disease (van't Hoff, W.G. (1996) Exp. Nephrol. 4:253-262; Talente, G M. et al. (1994) Ann. Intern. 
Med. 120:218-226; and Chillon, M et al: (1995) New Engl. J. Med. 332:1475-1480). 

Human diseases caused by mutations in ion channel genes include disorders of skeletal 
muscle, cardiac muscle, and the central nervous system. Mutations in the pore-forming subunits of 

30 sodium and chloride channels cause myotonia, a muscle disorder in which relaxation after voluntary 
contraction is delayed. Sodium channel myotonias have been treated with channel blockers. 
Mutations in muscle sodium and calcium channels cause forms of periodic paralysis, while 
mutations in the sarcoplasmic calcium release channel, T-tubule calcium channel, and muscle 
sodium channel cause malignant hyperthermia. Cardiac arrythmia disorders such as the long QT 

35 syndromes and idiopathic ventricular fibrillation are caused by mutations in potassium and sodium 
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channels (Cooper, E.C. and L.Y. Jan (1998) Proc. NatL Acad. Sci. USA 96:4759-4766). All four 
known human idiopathic epilepsy genes code for ion channel proteins (Berkovic, S.F. and LE. 
Scheffer (1999) Curr. Opin. Neurology 12:177-182). Other neurological disorders such as ataxias, 
hemiplegia migraine and hereditary deafness can also result from mutations in ion channel genes 
5 (Jen, J. (1999) Curr. Opin. Neurobiol. 9:274-280; Cooper, supra) . 

Ion channels have been the target for many drug therapies. Neurotransmitter-gated channels 
have been targeted in therapies for treatment of insomnia, anxiety, depression, and schizophrenia. 
Voltage-gated channels have been targeted in therapies for arrhythmia, ischemic stroke, head 
trauma, and neurodegenerative disease (Taylor, CP. and L.S. Narasimhan (1997) Adv. Pharmacol. 

10 39:47-98). Various classes of ion channels also play an important role in the perception of pain, and 
thus are potential targets for new analgesics. These include the vanilloid-gated ion channels, which 
are activated by the vanilloid capsaicin, as well as by noxious heat. Local anesthetics such as ■ ' 
lidocaine and mexiletinc which blockade voltage-gated Na + channels have been useful in the 
treatment of neuropathic pain (Eglen, supra ). Calcium-channel protein expression is altered in 

15 metastatic melanomas (Enklaar. T. et al. (2000) Genomics 67:179-187. 

Ion channels in the immune system have recently been suggested as targets for 
immunomodulation. T<ell activation depends upon calcium signaling, and a diverse set of T-cell 
specific ion channels has been characterized that affect this signaling process. Channel blocking 
agents can inhibit secretion of lympbokines, cell proliferation, and killing of target cells. A peptide 

20 antagonist of the T-cell potassium channel Kvl.3 was found to suppress delayed-type 

hypersensitivity and allogenic responses in pigs, validating the idea of channel blockers as safe and 
efficacious immunosuppressants (Cahalan, M.D. and K.G. Chandy (1997) Curr. Opin. Biotechnol. 
8:749-756). 

The discovery of new transporters and ion channels, and the polynucleotides encoding 
25 them, satisfies a need in the art by providing new compositions which are useful in the diagnosis, 
prevention, and treatment of transport, neurological, muscle, immunological and cell proliferative 
disorders, and in the assessment of the effects of exogenous compounds on the expression of nucleic 
acid and amino acid sequences of transporters and ion channels. 

30 SUMMARY OF THE INVENTION 

The invention features purified polypeptides, transporters and ion channels, referred to 
collectively as 'TRICH" and individually as "TRICH-1," 'TRICH-2," 'TRICH-3," 'TRICH-4," 
"TRICH-5," 'TRICH-6," 'TRICHW 'TRICH-8," 'TRICH-9," 'TRICH-10," "TRICH-11," 
"TRICH-12," "TRICH-13," 'TRICH-14," 'TRICH-15," "TRICH-16," 'TRICH-17," "TRICH-18," 
35 'TRICH-19," 'TRICH-20," "TRICH-21," "TRICH-22," 'TRICH-23," 'TRICH-24," 'TRICH-25," 
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"TRICH-26," 'TRICH-27/' 'TRICH-28," "TRICH-29," "TRICH-30," "TRICH-31," and "TRICH- 
32." In one aspect, the invention provides an isolated polypeptide selected from the group 
consisting of a) a polypeptide comprising an amino acid sequence selected from the group 
consisting of SEQ ID NO: 1-32, b) a polypeptide comprising a naturally occurring amino acid 
5 sequence at least 90% identical to an amino acid sequence selected from the group consisting of 
SEQ ED NO: 1-32, c) a biologically active fragment of a polypeptide having an amino acid sequence 
selected from the group consisting of SEQ ID NO: 1-32, and d) an immunogenic fragment of a 
polypeptide having an amino acid sequence selected from the group consisting of SEQ ID NO: 1-32. 
In one alternative, the invention provides an isolated polypeptide comprising the amino acid 

10 sequence of SEQ ID NO: 1-32. 

The invention further provides an isolated polynucleotide encoding a polypeptide selected 
from the group consisting of a) a polypeptide comprising an amino acid sequence selected from the"' 
group consisting of SEQ ID NO: 1-32, b) a polypeptide comprising a naturally occurring amino acid 
sequence at least 90% identical to an amino acid sequence selected from the group consisting of 

15 SEQ ID NO: 1-32, c) a biologically active fragment of a polypeptide having an amino acid sequence 
selected from the group consisting of SEQ ID NO: 1-32, and d) an immunogenic fragment of a 
polypeptide having an amino acid sequence selected from the group consisting of SEQ ID NO: 1-32. 
In one alternative, the polynucleotide encodes a polypeptide selected from the group consisting of 
SEQ ID NO: 1-32. In another alternative, the polynucleotide is selected from the group consisting of 

20 SEQ ID NO:33-64. 

Additionally, the invention provides a recombinant polynucleotide comprising a promoter 
sequence operably linked to a polynucleotide encoding a polypeptide selected from the group 
consisting of a) a polypeptide comprising an amino acid sequence selected from the group 
consisting of SEQ ID NO: 1-32, b) a polypeptide comprising a naturally occurring amino acid 

25 sequence at least 90% identical to an amino acid sequence selected from the group consisting of 
SEQ ID NO: 1-32, c) a biologically active fragment of a polypeptide having an amino acid sequence 
selected from the group consisting of SEQ ID NO: 1-32, and d) an immunogenic fragment of a 
polypeptide having an amino acid sequence selected from the group consisting of SEQ ID NO: 1-32. 
In one alternative, the invention provides a cell transformed with the recombinant polynucleotide. 

30 In another alternative, the invention provides a transgenic organism comprising the recombinant 
polynucleotide. 

The invention also provides a method for producing a polypeptide selected from the group 
consisting of a) a polypeptide comprising an amino acid sequence selected from the group 
consisting of SEQ ID NO: 1-32, b) a polypeptide comprising a naturally occurring amino acid 
35 sequence at least 90% identical to an amino acid sequence selected from the group consisting of 
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SEQ ID NO: 1-32, c) a biologically active fragment of a polypeptide having an amino acid sequence 
selected from the group consisting of SEQ ID NO: 1-32, and d) an immunogenic fragment of a 
polypeptide having an amino acid sequence selected from the group consisting of SEQ ID NO: 1-32. 
The method comprises a) culturing a cell under conditions suitable for expression of the 
5 polypeptide, wherein said cell is transformed with a recombinant polynucleotide comprising a 
promoter sequence operably linked to a polynucleotide encoding the polypeptide, and b) recovering 
the polypeptide so expressed. 

Additionally, the invention provides an isolated antibody which specifically binds to a 
polypeptide selected from the group consisting of a) a polypeptide comprising an amino acid 

10 sequence selected from the group consisting of SEQ ID NO: 1-32, b) a polypeptide comprising a 
naturally occurring amino acid sequence at least 90% identical to an amino acid sequence selected 
from the group consisting of SEQ ID NO: 1-32, c) a biologically active fragment of a polypeptide ' 
having an amino acid sequence selected from the group consisting of SEQ ID NO: 1-32, and d) an 
immunogenic fragment of a polypeptide having an amino acid sequence selected from the group 

15 consisting of SEQ ID NO: 1 -32. 

The invention further provides an isolated polynucleotide selected from the group 
consisting of a) a polynucleotide comprising a polynucleotide sequence selected from the group 
consisting of SEQ ID NO:33-64, b) a polynucleotide comprising a naturally occurring 
polynucleotide sequence at least 90% identical to a polynucleotide sequence selected from the 

20 group consisting of SEQ ID NO:33-64, c) a polynucleotide complementary to the polynucleotide of 
a), d) a polynucleotide complementary to the polynucleotide of b), and e) an RNA equivalent of a)- 
d). In one alternative, the polynucleotide comprises at least 60 contiguous nucleotides. 

Additionally, the invention provides a method for detecting a target polynucleotide in a 
sample, said target polynucleotide having a sequence of a polynucleotide selected from the group 

25 consisting of a) a polynucleotide comprising a polynucleotide sequence selected from the group 
consisting of SEQ ID NO:33-64, b) a polynucleotide comprising a naturally occurring 
polynucleotide sequence at least 90% identical to a polynucleotide sequence selected from the 
group consisting of SEQ ID NO:33-64, c) a polynucleotide complementary to the polynucleotide of 
a), d) a polynucleotide complementary to the polynucleotide of b), and e) an RNA equivalent of a)- 

30 d). The method comprises a) hybridizing the sample with a probe comprising at least 20 contiguous 
nucleotides comprising a sequence complementary to said target polynucleotide in the sample, and 
which probe specifically hybridizes to said target polynucleotide, under conditions whereby a 
hybridization complex is formed between said probe and said target polynucleotide or fragments 
thereof, and b) detecting the presence or absence of said hybridization complex, and optionally, if 

35 present, the amount thereof. In one alternative, the probe comprises at least 60 contiguous 
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nucleotides. 

The invention further provides a method for detecting a target polynucleotide in a sample, 
said target polynucleotide having a sequence of a polynucleotide selected from the group consisting 
of a) a polynucleotide comprising a polynucleotide sequence selected from the group consisting of 
5 SEQ ID NO.33-64, b) a polynucleotide comprising a naturally occurring polynucleotide sequence at 
least 90% identical to a polynucleotide sequence selected from the group consisting of SEQ ID 
NO:33-64, c) a polynucleotide complementary to the polynucleotide of a), d) a polynucleotide 
complementary to the polynucleotide of b), and e) an RNA equivalent of a)-d). The method 
comprises a) amplifying said target polynucleotide or fragment thereof using polymerase chain 

10 reaction amplification, and b) detecting the presence or absence of said amplified target 
polynucleotide or fragment thereof, and, optionally, if present, the amount thereof. 

The invention further provides a composition comprising an effective amount of a 
polypeptide selected from the group consisting of a) a polypeptide comprising an amino acid 
sequence selected from the group consisting of SEQ ID NO: 1-32, b) a polypeptide comprising a 

15 naturally occurring amino acid sequence at least 90% identical to an amino acid sequence selected 
from the group consisting of SEQ ID NO: 1-32, c) a biologically active fragment of a polypeptide 
having an amino acid sequence selected from the group consisting of SEQ ID NO: 1-32, and d) an 
immunogenic fragment of a polypeptide having an amino acid sequence selected from the group 
consisting of SEQ ID NO: 1-32, and a pharmaceutical^ acceptable excipient. In one embodiment, 

20 the composition comprises an amino acid sequence selected from the group consisting of SEQ ID 
NO: 1-32. The invention additionally provides a method of treating a disease or condition associated 
with decreased expression of functional TRICH, comprising administering to a patient in need of 
such treatment the composition. 

The invention also provides a method for screening a compound for effectiveness as an 

25 agonist of a polypeptide selected from the group consisting of a) a polypeptide comprising an amino 
acid sequence selected from the group consisting of SEQ ED NO: 1-32, b) a polypeptide comprising 
a naturally occurring amino acid sequence at least 90% identical to an amino acid sequence selected 
from the group consisting of SEQ ID NO: 1-32, c) a biologically active fragment of a polypeptide 
having an amino acid sequence selected from the group consisting of SEQ ID NO: 1-32, and d) an 

30 immunogenic fragment of a polypeptide having an amino acid sequence selected from the group 
consisting of SEQ ID NO: 1-32. The method comprises a) exposing a sample comprising the 
polypeptide to a compound, and b) detecting agonist activity in the sample. In one alternative, the 
invention provides a composition comprising an agonist compound identified by the method and a 
pharmaceutically acceptable excipient. In another alternative, the invention provides a method of 

35 treating a disease or condition associated with decreased expression of functional TRICH, 
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comprising administering to a patient in need of such treatment the composition. 

Additionally, the invention provides a method for screening a compound for effectiveness 
as an antagonist of a polypeptide selected from the group consisting of a) a polypeptide comprising 
an amino acid sequence selected from the group consisting of SEQ ID NO: 1-32, b) a polypeptide 
5 comprising a naturally occurring amino acid sequence at least 90% identical to an amino acid 
sequence selected from the group consisting of SEQ ID NO: 1-32, c) a biologically active fragment 
of a polypeptide having an amino acid sequence selected from the group consisting of SEQ ID 
NO: 1-32, and d) an immunogenic fragment of a polypeptide having an amino acid sequence selected 
from the group consisting of SEQ ID NO: 1-32. The method comprises a) exposing a sample 

10 comprising the polypeptide to a compound, and b) detecting antagonist activity in the sample. In 
one alternative, the invention provides a composition comprising an antagonist compound identified 
by the method and a pharmaceutically acceptable excipient. Li another alternative, the invention ' 
provides a method of treating a disease or condition associated with overexpression of functional 
TRICH, comprising administering to a patient in need of such treatment the composition. 

15 The invention further provides a method of screening for a compound that specifically 

binds to a polypeptide selected from the group consisting of a) a polypeptide comprising an amino 
acid sequence selected from the group consisting of SEQ ID NO: 1-32, b) a polypeptide comprising 
a naturally occurring amino acid sequence at least 90% identical to an amino acid sequence selected 
from the group consisting of SEQ ID NO: 1-32, c) a biologically active fragment of a polypeptide 

20 having an amino acid sequence selected from the group consisting of SEQ ID NO: 1-32, and d) an 
immunogenic fragment of a polypeptide having an amino acid sequence selected from the group 
consisting of SEQ ID NO: 1-32. The method comprises a) combining the polypeptide with at least 
one test compound under suitable conditions, and b) detecting binding of the polypeptide to the test 
compound, thereby identifying a compound that specifically binds to the polypeptide. 

25 The invention further provides a method of screening for a compound that modulates the 

activity of a polypeptide selected from the group consisting of a) a polypeptide comprising an amino 
acid sequence selected from the group consisting of SEQ ID NO: 1-32, b) a polypeptide comprising 
a naturally occurring amino acid sequence at least 90% identical to an amino acid sequence selected 
from the group consisting of SEQ ID NO: 1-32, c) a biologically active fragment of a polypeptide 

30 having an amino acid sequence selected from the group consisting of SEQ ID NO: 1-32, and d) an 
immunogenic fragment of a polypeptide having an amino acid sequence selected from the group 
consisting of SEQ ID NO: 1-32. The method comprises a) combining the polypeptide with at least 
one test compound under conditions permissive for the activity of the polypeptide, b) assessing the 
activity of the polypeptide in the presence of the test compound, and c) comparing the activity of 

35 the polypeptide in the presence of the test compound with the activity of the polypeptide in the 
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absence of the test compound, wherein a change in the activity of the polypeptide in the presence of 
the test compound is indicative of a compound that modulates the activity of the polypeptide. 

The invention further provides a method for screening a compound for effectiveness in 
altering expression of a target polynucleotide, wherein said target polynucleotide comprises a 
polynucleotide sequence selected from the group consisting of SEQ ID NO:33-64, the method 
comprising a) exposing a sample comprising the target polynucleotide to a compound, b) detecting 
altered expression of the target polynucleotide, and c) comparing the expression of the target 
polynucleotide in the presence of varying amounts of the compound and in the absence of the 
compound. 

The invention further provides ia method for assessing toxicity of a test compound, said 
method comprising a) treating a biological sample containing nucleic acids with the test compound; 
b) hybridizing the nucleic acids of the treated biological sample with a probe comprising at least 20^ 
contiguous nucleotides of a polynucleotide selected from the group consisting of i) a polynucleotide 
comprising a polynucleotide sequence selected from the group consisting of SEQ ID NO:33-64, ii) 
a polynucleotide comprising a naturally occurring polynucleotide sequence at least 90% identical to 
a polynucleotide sequence selected from the group consisting of SEQ ID NO:33-64, iii) a 
polynucleotide having a sequence complementary to i), iv) a polynucleotide complementary to the 
polynucleotide of ii), and v) an RNA equivalent of i)-iv). Hybridization occurs under conditions 
whereby a specific hybridization complex is formed between said probe and a target polynucleotide 
in the biological sample, said target polynucleotide selected from the group consisting of i) a 
polynucleotide comprising a polynucleotide sequence selected from the group consisting of SEQ ID 
NO:33-64, ii) a polynucleotide comprising a naturally occurring polynucleotide sequence at least 
90% identical to a polynucleotide sequence selected from the group consisting of SEQ ID NO:33- 
64, iii) a polynucleotide complementary to the polynucleotide of i), iv) a polynucleotide 
complementary to the polynucleotide of ii), and v) an RNA equivalent of i)-iv). Alternatively, the 
target polynucleotide comprises a fragment of a polynucleotide sequence selected from the group 
consisting of i)-v) above; c) quantifying the amount of hybridization complex; and d) comparing the 
amount of hybridization complex in the treated biological sample with the amount of hybridization 
complex in an untreated biological sample, wherein a difference in the amount of hybridization 
complex in the treated biological sample is indicative of toxicity of the test compound. 

BRIEF DESCRIPTION OF THE TABLES 
Table 1 summarizes the nomenclature for the full length polynucleotide and polypeptide 
sequences of the present invention. 

Table 2 shows the GenBank identification number and annotation of the nearest GenBank 
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homolog for polypeptides of the invention. The probability scores for the matches between each 
polypeptide and its homolog(s) are also shown. 

Table 3 shows structural features of polypeptide sequences of the invention, including 
predicted motifs and domains, along with the methods, algorithms, and searchable databases used 
5 for analysis of the polypeptides. 

Table 4 lists the cDNA and/or genomic DNA fragments which were used to assemble 
polynucleotide sequences of the invention, along with selected fragments of the polynucleotide 
sequences. 

Table 5 shows the representative cDNA library for polynucleotides of the invention. 
10 Table 6 provides an appendix which describes the tissues and vectors used for construction 

of the cDNA libraries shown in Table 5. 

Table 7 shows the tools, programs, and algorithms used to analyze the polynucleotides and' 
polypeptides of the invention, along with applicable descriptions, references, and threshold 
parameters. 

15 

DESCRIPTION OF THE INVENTION 

Before the present proteins, nucleotide sequences, and methods are described, it is 
understood that this invention is not limited to the particular machines, materials and methods 
described, as these may vary. It is also to be understood that the terminology used herein is for the 

20 puipose of describing particular embodiments only, and is not intended to limit the scope of the 
present invention which will be limited only by the appended claims. 

It must be noted that as used herein and in the appended claims, the singular forms "a," 
"an," and "the" include plural reference unless the context clearly dictates otherwise. Thus, for 
example, a reference to "a host cell" includes a plurality of such host cells, and a reference to "an 

25 antibody" is a reference to one or more antibodies and equivalents thereof known to those skilled in 
the art, and so forth. 

Unless defined otherwise, all technical and scientific terms used herein have the same 
meanings as commonly understood by one of ordinary skill in the art to which this invention 
belongs. Although any machines, materials, and methods similar or equivalent to those described 

30 herein can be used to practice or test the present invention, the preferred machines, materials and 
methods are now described. All publications mentioned herein are cited for the purpose of 
describing and disclosing the cell lines, protocols, reagents and vectors which are reported in the 
publications and which might be used in connection with the invention. Nothing herein is to be 
construed as an admission that the invention is not entitled to antedate such disclosure by virtue of 

35 prior invention. 
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DEFINITIONS 

•TRICH" refers to the amino acid sequences of substantially purified TRICH obtained from 
any species, particularly a mammalian species, including bovine, ovine, porcine, murine, equine, 
and human, and from any source, whether natural, synthetic, semi-synthetic, or recombinant. 
5 The term "agonist" refers to a molecule which intensifies or mimics the biological activity 

of TRICH. Agonists may include proteins, nucleic acids, carbohydrates, small molecules, or any 
other compound or composition which modulates the activity of TRICH either by directly 
interacting with TRICH or by acting on components of the biological pathway in which TRICH " 
participates. 

10 An "allelic variant" is an alternative form of the gene encoding TRICH. Allelic variants 

may result from at least one mutation in the nucleic acid sequence and may result in altered mRNAs 
or in polypeptides whose structure or function may or may not be altered. A gene may have none, ' 
one, or many allelic variants of its naturally occurring form. Common mutational changes which ' 
give rise to allelic variants are generally ascribed to natural deletions, additions, or substitutions of 

15 nucleotides. Each of these types of changes may occur alone, or in combination with the others, one 
or more times in a given sequence. 

"Altered" nucleic acid sequences encoding TRICH include those sequences with deletions, 
insertions, or substitutions of different nucleotides, resulting in a polypeptide the same as TRICH or 
a polypeptide with at least one functional characteristic of TRICH. Included within this definition 

20 are polymorphisms which may or may not be readily detectable using a particular oligonucleotide 
probe of the polynucleotide encoding TRICH, and improper or unexpected hybridization to allelic 
variants, with a locus other than the normal chromosomal locus for the polynucleotide sequence 
encoding TRICH. The encoded protein may also be "altered," and may contain deletions, 
insertions, or substitutions of amino acid residues which produce a silent change and result in a 

25 functionally equivalent TRICH. Deliberate amino acid substitutions may be made on the basis of 
similarity in polarity, charge, solubility, hydrophobicity, hydrophilicity, and/or the amphipathic 
nature of the residues, as long as the biological or immunological activity of TRICH is retained. For 
example, negatively charged amino acids may include aspartic acid and glutamic acid, and 
positively charged amino acids may include lysine and arginine. Amino acids with uncharged polar 

30 side chains having similar hydrophilicity values may include: asparagine and glutamine; and serine 
and threonine. Amino acids with uncharged side chains having similar hydrophilicity values may 
include: leucine, isoleucine, and valine; glycine and alanine; and phenylalanine and tyrosine. 

The terms "amino acid" and "amino acid sequence" refer to an oligopeptide, peptide, 
polypeptide, or protein sequence, or a fragment of any of these, and to naturally occurring or 

35 synthetic molecules. Where "amino acid sequence" is recited to refer to a sequence of a naturally 
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occurring protein molecule, "amino acid sequence" and like terms are not meant to limit the amino 
acid sequence to the complete native amino acid sequence associated with the recited protein 
molecule. 

"Amplification" relates to the production of additional copies of a nucleic acid sequence. 
5 Amplification is generally carried out using polymerase chain reaction (PCR) technologies well 
known in the art. 

The term "antagonist" refers to a molecule which inhibits or attenuates the biological 
activity of TRICH. Antagonists may include proteins such as antibodies, nucleic acids, 
carbohydrates, small molecules, or any other compound or composition which modulates the 

10 activity of TRICH either by directly interacting with TRICH or by acting on components of the 
biological pathway in which TRICH participates. 

The term "antibody" refers to intact immunoglobulin molecules as well as to fragments y 
thereof, such as Fab, F(ab') 2 , and Fv fragments, which are capable of binding an epitopic 
determinant. Antibodies that bind TRICH polypeptides can be prepared using intact polypeptides or 

15 using fragments containing small peptides of interest as the immunizing antigen. The polypeptide 
or oligopeptide used to immunize an animal (e.g., a mouse, a rat, or a rabbit) can be derived from 
the translation of RNA, or synthesized chemically, and can be conjugated to a carrier protein if 
desired. Commonly used carriers that are chemically coupled to peptides include bovine serum 
albumin, thyroglobulin, and keyhole limpet hemocyanin (KLH). The coupled peptide is then used 

20 to immunize the animal. 

The term "antigenic determinant" refers to that region of a molecule (i.e., an epitope) that 
makes contact with a particular antibody. When a protein or a fragment of a protein is used to 
immunize a host animal, numerous regions of the protein may induce the production of antibodies 
which bind specifically to antigenic determinants (particular regions or three-dimensional structures 

25 on the protein). An antigenic determinant may compete with the intact antigen (i.e., the immunogen 
used to elicit the immune response) for binding to an antibody. 

The term "aptamer" refers to a nucleic acid or oligonucleotide molecule that binds to a 
specific molecular target. Aptamers are derived from an in vitro evolutionary process (e.g., SELEX 
(Systematic Evolution of Ligands by Exponential Enrichment), described in U.S. Patent No. 

30 5,270,163), which selects for target-specific aptamer sequences from large combinatorial libraries. 
Aptamer compositions may be double-stranded or single-stranded, and may include 
deoxyribonucleotides, ribonucleotides, nucleotide derivatives, or other nucleotide-like molecules. 
The nucleotide components of an aptamer may have modified sugar groups (e.g., the 2-OH group of 
a ribonucleotide may be replaced by 2-F or 2-NH 2 ), which may improve a desired property, e.g., 

35 resistance to nucleases or longer lifetime in blood. Aptamers may be conjugated to other molecules, 
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e.g., a high molecular weight carrier to slow clearance of the aptanier from the circulatory system. 
Aptamers may be specifically cross-linked to their cognate ligands, e.g., by photo-activation of a 
cross-linker. (See, e.g., Brody, E.N. and L. Gold (2000) J. Biotechnol. 74:5-13.) 

The term "intramer" refers to an aptamer which is expressed in vivo . For example, a 
vaccinia virus-based RNA expression system has been used to express specific RNA aptamers at 
high levels in the cytoplasm of leukocytes (Blind, M. et al. (1999) Proc. Natl Acad. Sci. USA 
96:3606-3610). 

The term "spiegelmer" refers to an aptamer which includes L-DNA, L-RNA, or other left- 
handed nucleotide derivatives or nucleotide-like molecules. Aptamers containing left-handed 
nucleotides are resistant to degradation by naturally occurring enzymes, which normally act on 
substrates containing nghi -handed nucleotides. 

The term "antisense" refers to any composition capable of base-pairing with the "sense" ' 
(coding) strand of a specific nucleic acid sequence. Antisense compositions may include DNA; 
RNA; peptide nucleic acid (PNA); oligonucleotides having modified backbone linkages such as 
phosphorothioates. rnethylphosphonates, or benzylphosphonates; oligonucleotides having modified 
sugar groups such as 2-rnethoxycthyl sugars or 2-methoxyethoxy sugars; or oligonucleotides having 
modified bases such as 5-mcthyI cylosine, 2-deoxyuracil, or 7-deaza-2'-deoxyguanosine. Antisense 
molecules may be produced by any method including chemical synthesis or transcription. Once 
introduced into a cell, the complementary antisense molecule base-pairs with a naturally occurring 
nucleic acid sequence produced by the cell to form duplexes which block either transcription or 
translation. The designation "negative" or "minus" can refer to the antisense strand, and the 
designation "positive" or "plus" can refer to the sense strand of a reference DNA molecule. 

The term "biologically active" refers to a protein having structural, regulatory, or 
biochemical functions of a naturally occurring molecule. Likewise, "immunologically active" or 
"immunogenic" refers to the capability of the natural, recombinant, or synthetic TRICH, or of any 
oligopeptide thereof, to induce a specific immune response in appropriate animals or cells and to 
bind with specific antibodies. 

"Complementary" describes the relationship between two single-stranded nucleic acid 
sequences that anneal by base-pairing. For example, 5-AGT-3' pairs with its complement, 
3'-TCA-5\ 

A "composition comprising a given polynucleotide sequence" and a "composition 
comprising a given amino acid sequence" refer broadly to any composition containing the given 
polynucleotide or amino acid sequence. The composition may comprise a dry formulation or an 
aqueous solution. Compositions comprising polynucleotide sequences encoding TRICH or 
fragments of TRICH may be employed as hybridization probes. The probes may be stored in freeze- 
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dried form and may be associated with a stabilizing agent such as a carbohydrate. In hybridizations, 
the probe may be deployed in an aqueous solution containing salts (e.g., NaCl), detergents (e.g., 
sodium dodecyl sulfate; SDS), and other components (e.g., Denhardt's solution, dry milk, salmon 
sperm DNA, etc.). 

5 - "Consensus sequence" refers to a nucleic acid sequence which has been subjected to 

repeated DNA sequence analysis to resolve uncalled bases, extended using the XL-PCR kit 
(Applied Biosystems, Foster City CA) in the 5' and/or the 3' direction, and resequenced, or which 
has been assembled from one or more overlapping cDNA, EST, or genomic DNA fragments using a 
computer program for fragment assembly, such as the GELVBEW fragment assembly system (GCG, 

10 Madison WI) or Phrap (University of Washington, Seattle WA). Some sequences have been both 
extended and assembled to produce the consensus sequence. 

"Conservative amino acid substitutions" are those substitutions that are predicted to least " 
interfere with the properties of the original protein, i.e., the structure and especially the function of 
the protein is conserved and not significantly changed by such substitutions. The table below shows 

15 amino acids which may be substituted for an original amino acid in a protein and which are 
regarded as conservative amino acid substitutions. 





Ala 


Gly, Ser 




Arg 


His, Lys 


20 


Asn 


Asp, Gin, His 




Asp 


Asn, Glu 




Cys 


Ala, Ser 




Gin 


Asn, Glu, His 




Glu 


Asp, Gin, His 


25 


Gly 


Ala 




His 


Asn, Arg, Gin, Glu 




lie 


Leu, Val 




Leu 


He, Val 




Lys 


Arg, Gin, Glu 


30 


Met 


Leu, He 




Phe 


His, Met, Leu, Trp, Tyr 




Ser 


Cys, Thr 




Thr 


Ser, Val 




Tip 


Phe, Tyr 


35 


Tyr 


His, Phe, Trp 




Val 


Be, Leu, Thr 



Conservative amino acid substitutions generally maintain (a) the structure of the 
polypeptide backbone in the area of the substitution, for example, as a beta sheet or alpha helical 
40 conformation, (b) the charge or hydrophobicity of the molecule at the site of the substitution, and/or 
(c) the bulk of the side chain. 

A "deletion" refers to a change in the amino acid or nucleotide sequence that results in the 
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absence of one or more amino acid residues or nucleotides. 

The term "derivative" refers to a chemically modified polynucleotide or polypeptide. 
Chemical modifications of a polynucleotide can include, for example, replacement of hydrogen by 
an alkyl, acyl, hydroxyl, or amino group. A derivative polynucleotide encodes a polypeptide which 
5 retains at least one biological or immunological function of the natural molecule. A derivative 
polypeptide is one modified by glycosylation, pegylation, or any similar process that retains at least 
one biological or immunological function of the polypeptide from which it was derived. 

A "detectable label" refers to a reporter molecule or enzyme that is capable of generating a 
measurable signal and is covalently or noncovalently joined to a polynucleotide or polypeptide. 

10 "Differential expression" refers to increased or upregulated; or decreased, downregulated, 

. or absent gene or protein expression, determined by comparing at least two different samples. Such 
comparisons may be earned out between, for example, a treated and an untreated sample, or a 
diseased and a normal sample. 

"Exon shuffling" refers to the recombination of different coding regions (exons). Since an 

15 exon may represent a structural or functional domain of the encoded protein, new proteins may be 
assembled through the novel reassortment of stable substructures, thus allowing acceleration of the 
evolution of new protein functions. 

A "fragment" is a unique portion of TRICH or the polynucleotide encoding TRICH which is 
identical in sequence to but shorter in length than the parent sequence. A fragment may comprise 

20 up to the entire length of the defined sequence, minus one nucleotide/amino acid residue. For 
example, a fragment may comprise from 5 to 1000 contiguous nucleotides or amino acid residues. 
A fragment used as a probe, primer, antigen, therapeutic molecule, or for other purposes, may be at 
least 5, 10, 15, 16, 20, 25, 30, 40, 50, 60, 75, 100, 150, 250 or at least 500 contiguous nucleotides or 
amino acid residues in length. Fragments may be preferentially selected from certain regions of a 

25 molecule. For example, a polypeptide fragment may comprise a certain length of contiguous amino 
acids selected from the first 250 or 500 amino acids (or first 25% or 50%) of a polypeptide as 
shown in a certain defined sequence. Clearly these lengths are exemplary, and any length that is 
supported by the specification, including the Sequence Listing, tables, and figures, may be 
encompassed by the present embodiments. 

30 A fragment of SEQ ID NO:33-64 comprises a region of unique polynucleotide sequence 

that specifically identifies SEQ ID NO:33-64, for example, as distinct from any other sequence in 
the genome from which the fragment was obtained. A fragment of SEQ ID NO:33-64 is useful, for 
example, in hybridization and amplification technologies and in analogous methods that distinguish 
SEQ ID NO:33-64 from related polynucleotide sequences. The precise length of a fragment of SEQ 

35 ID NO:33-64 and the region of SEQ ID NO:33-64 to which the fragment corresponds are routinely 
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determinable by one of ordinary skill in the art based on the intended purpose for the fragment. 

A fragment of SEQ ID NO: 1-32 is encoded by a fragment of SEQ ID NO:33-64. A 
fragment of SEQ ID NO: 1-32 comprises a region of unique amino acid sequence that specifically 
identifies SEQ ID NO: 1-32. For example, a fragment of SEQ ID NO: 1-32 is useful as an 
5 immunogenic peptide for the development of antibodies that specifically recognize SEQ ID NO:l- 
32. The precise length of a fragment of SEQ ID NO:l-32 and the region of SEQ ID NO: 1-32 to 
which the fragment corresponds are routinely determinable by one of ordinary skill in the art based 
on the intended purpose for the fragment. 

A "full length" polynucleotide sequence is one containing at least a translation initiation 
10 codon (e.g., methionine) followed by an open reading frame and a translation termination codon. A 
"full length" polynucleotide sequence encodes a "full length" polypeptide sequence. 

"Homology" refers to sequence similarity or, interchangeably, sequence identity, between' 
two or more polynucleocide sequences or two or more polypeptide sequences. 

The terms "percent identity" and "% identity," as applied to polynucleotide sequences, refer 
15 to the percentage of residue matches between at least two polynucleotide sequences aligned using a 
standardized algorithm. Such an algorithm may insert, in a standardized and reproducible way, gaps 
in the sequences being compared in order to optimize alignment between two sequences, and 
therefore achieve a more meaningful comparison of the two sequences. 

Percent identity between polynucleotide sequences may be determined using the default 
20 parameters of the CLUSTAL V algorithm as incorporated into the MEGALIGN version 3. 12e 

sequence alignment program This program is part of the LASERGENE software package, a suite of 
molecular biological analysis programs (DNASTAR, Madison WI). CLUSTAL V is described in 
Higgins, D.G. and P.M. Sharp (1989) CABIOS 5:151-153 and in Higgins, D.G. et al. (1992) 
CABIOS 8:189-191. For pairwise alignments of polynucleotide sequences, the default parameters 
25 are set as follows: Ktuple=2, gap penalty=5, window=4, and "diagonals saved"=4. The "weighted" 
residue weight table is selected as the default. Percent identity is reported by CLUSTAL V as the 
"percent similarity" between aligned polynucleotide sequences. 

Alternatively, a suite of commonly used and freely available sequence comparison 
algorithms is provided by the National Center for Biotechnology Information (NCBI) Basic Local 
30 Alignment Search Tool (BLAST) (Altschul, S.F. et al. (1990) J. Mol. Biol. 215:403-410), which is 
available from several sources, including the NCBI, Bethesda, MD, and on the Internet at 
http://www.ncbi.nlm.nih.gov/BLAST/. The BLAST software suite includes various sequence 
analysis programs including "blastn," that is used to align a known polynucleotide sequence with 
other polynucleotide sequences from a variety of databases. Also available is a tool called "BLAST 
2 Sequences" that is used for direct pairwise comparison of two nucleotide sequences. <C BLAST 2 
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Sequences" can be accessed and used interactively at http://www.ncbi.nlm.nih.gov/gorf/bl2.html. 
The "BLAST 2 Sequences" tool can be used for both blastn and blastp (discussed below). BLAST 
programs are commonly used with gap and other parameters set to default settings. For example, to 
compare two nucleotide sequences, one may use blastn with the "BLAST 2 Sequences" tool 
5 Version 2.0. 12 (April-2 1-2000) set at default parameters. Such default parameters may be, for 
example: 

Matrix: BLOSUM62 
Reward for match: 1 
Penalty for mismatch: -2 
10 Open Gap: 5 and Extension Gap: 2 penalties 

Gap x drop-off: 50 
Expect: 10 
Word Size: 11 
- Filter: on 

1 5 Percent identity may be measured over the length of an entire defined sequence, for 

example, as defined by a particular SEQ ID number, or may be measured over a shorter length, for 
example, over the length of a fragment taken from a larger, defined sequence, for instance, a 
fragment of at least 20, at least 30, at least 40, at least 50, at least 70, at least 100, or at least 200 
contiguous nucleotides.. Such lengths are exemplary only, and it is understood that any fragment 

20 length supported by the sequences shown herein, in the tables, figures, or Sequence Listing, may be 
used to describe a length over which percentage identity may be measured. 

: Nucleic acid sequences that do not show a high degree of identity may nevertheless encode 
similar amino acid sequences due to the degeneracy of the genetic code. It is understood that 
changes in a nucleic acid sequence can be made using this degeneracy to produce multiple nucleic 

25 acid sequences that all encode substantially the same protein. 

The phrases "percent identity" and "% identity," as applied to polypeptide sequences, refer 
to the percentage of residue matches between at least two polypeptide sequences aligned using a 
standardized algorithm. Methods of polypeptide sequence alignment are well-known. Some 
alignment methods take into account conservative amino acid substitutions. Such conservative 

30 substitutions, explained in more detail above, generally preserve the charge andjiydrophobicity at 
the site of substitution, thus preserving the structure/and therefore function) of the polypeptide. 

Percent identity between polypeptide sequences may be determined using the default 
parameters of the CLUSTAL V algorithm as incorporated into the MEGALIGN version 3.12e 
sequence alignment program (described and referenced above). For pairwise alignments of 

35 polypeptide sequences using CLUSTAL V, the default parameters are set as follows: Ktuple=l, gap 

27 



5DOCID: <WO 02464 15A2JA> 



WO 02/46415 



PCT/US01 746963 



penalty=3, window=5, and "diagonals saved"=5. The PAM250 matrix is selected as the default 
residue weight table.. As with polynucleotide alignments, the percent identity is reported by 
CLUSTAL V as the "percent similarity" between aligned polypeptide sequence pairs. 

Alternatively the NCBI BLAST software suite may be used. For example, for a pairwise 
comparison of two polypeptide sequences, one may use the "BLAST 2 Sequences" tool Version 
2.0. 12 (April-21-2000) with blastp set at default parameters. Such default parameters may be, for 
example: 

Matrix: BLOSUM62 

Open Gap: 11 and Extension Gap: 1 penalties 
Gap x drop-off: 50 
Expect: 10 
Word Size: 3 
Filter: on 

Percent identity may be measured over the length of an entire defined polypeptide sequence, 
for example, as defined by a particular SEQ ID number, or may be measured over a shorter length, 
for example, over the length of a fragment taken from a larger, defined polypeptide sequence, for 
instance, a fragment of at least 15, at least 20, at least 30, at least 40, at least 50, at least 70 or at 
least 150 contiguous residues. Such lengths are exemplary only, and it is understood that any 
fragment length supported by the sequences shown herein, in the tables, figures or Sequence 
Listing, may be used to describe a length over which percentage identity may be measured. 

"Human artificial chromosomes" (HACs) are linear microchromosomes which may contain 
DNA sequences of about 6 kb to 10 Mb in size and which contain all of the elements required for 
chromosome replication, segregation and maintenance. ■ " 

The term "humanized antibody" refers to an antibody molecule in which the amino acid 
sequence in the non-antigen binding regions has been altered so that the antibody more closely 
resembles a human antibody, and still retains its original binding ability. 

"Hybridization" refers to the process by which a polynucleotide strand anneals with a 
complementary strand through base pairing under defined hybridization conditions. Specific 
hybridization is an indication that two nucleic acid sequences share a high degree of 
complementarity. Specific hybridization complexes form under permissive annealing conditions 
and remain hybridized after the "washing" step(s). The washing step(s) is particularly important in 
determining the stringency of the hybridization process, with more stringent conditions allowing 
less non-specific binding, i.e., binding between pairs of nucleic acid strands that are not perfectly 
matched. Permissive conditions for annealing of nucleic acid sequences are routinely determinable 
by one of ordinary skill in the art and may be consistent among hybridization experiments, whereas 
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wash conditions may be varied among experiments to achieve the desired stringency, and therefore 
hybridization specificity. Permissive annealing conditions occur, for example, at 68°C in the 
presence of about 6 x SSC, about 1% (w/v) SDS, and about 100 /zg/ml sheared, denatured salmon 
sperm DNA, 

5 Generally, stringency of hybridization is expressed, in part, with reference to the 

temperature under which the wash step is carried out. Such wash temperatures are typically 
selected to be about 5°C to 20°C lower than the thermal melting point (TJ for the specific sequence 
at a defined ionic strength and pH. The T m is the temperature (under defined ionic strength and pH) 
at which 50% of the target sequence hybridizes to a perfectly matched probe. An equation for 

10 calculating T m and conditions for nucleic acid hybridization are well known and can be found in 
Sambrook, J. et al. (1989) Molecular Cloning: A Laboratory Manual . 2 nd ed., vol. 1-3, Cold Spring 
Harbor Press, Plainview NY; specifically see volume 2, chapter 9. 

High stringency conditions for hybridization between polynucleotides of the present 
invention include wash conditions of 68°C in the presence of about 0.2 x SSC and about 0.1% SDS, 

15 fori hour. Alternatively, temperatures of about 65°C, 60°C, 55°C, or 42°C may be used. SSC 
concentration may be varied from about 0.1 to 2 x SSC, with SDS being present at about 0.1%. 
Typically, blocking reagents are used to block non-specific hybridization. Such blocking reagents 
include, for instance, sheared and denatured salmon sperm DNA at about 100-200 fig/ml. Organic 
solvent, such as formamide at a concentration of about 35-50% v/v, may also be used under 

20 particular circumstances, such as for RNA:DNA hybridizations. Useful variations on these wash 
conditions will be readily apparent to those of ordinary skill in the art. Hybridization, particularly 
under high stringency conditions, may be suggestive of evolutionary similarity between the 
nucleotides. Such similarity is strongly indicative of a similar role for the nucleotides and their 
encoded polypeptides. 

25 The term "hybridization complex" refers to a complex formed between two nucleic acid 

sequences by virtue of the formation of hydrogen bonds between complementary bases. A 
hybridization complex may be formed in solution (e.g., C 0 t or R 0 t analysis) or formed between one 
nucleic acid sequence present in solution and another nucleic acid sequence immobilized on a solid 
support (e.g., paper, membranes, filters, chips, pins or glass slides, or any other appropriate 

30 substrate to which cells or their nucleic acids have been fixed). 

The words "insertion" and "addition" refer to changes in an amino acid or nucleotide 
sequence resulting in the addition of one or more amino acid residues or nucleotides, respectively. 

"Immune response" can refer to conditions associated with inflammation, trauma, immune 
disorders, or infectious or genetic disease, etc. These conditions can be characterized by expression 

35 of various factors, e.g., cytokines, chemokines, and other signaling molecules, which may affect 
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cellular and systemic defense systems. 

An "immunogenic fragment" is a polypeptide or oligopeptide fragment of TRICH which is 
capable of eliciting an immune response when introduced into a living organism, for example, a 
mammal. The term "immunogenic fragment" also includes any polypeptide or oligopeptide 
5 fragment of TRICH which is useful in any of the antibody production methods disclosed herein or 
known in the art. 

The term "microarray" refers to an arrangement of a plurality of polynucleotides, 
polypeptides, or other chemical compounds on a substrate. 

The terms "element" and "array element" refer to a polynucleotide, polypeptide, or other 
10 chemical compound having a unique and defined position on a microairay. 

The term "modulate" refers to a change in the activity of TRICH. For example, modulation 
may cause an increase or a decrease in protein activity, binding characteristics, or any other 
biological, functional, or immunological properties of TRICH. 

The phrases "nucleic acid" and "nucleic acid sequence" refer to a nucleotide, 
15 oligonucleotide, polynucleotide, or any fragment thereof. These phrases also refer to DNA or RNA 
of genomic or synthetic origin which may be single-stranded or double-stranded and may represent 
the sense or the antisense strand, to peptide nucleic acid (PNA), or to any DNA-like or RNA-like 
material. 

"Operably linked" refers to the situation in which a first nucleic acid sequence is placed in a 
20 functional relationship with a second nucleic acid sequence. For instance, a promoter is operably 
linked to a coding sequence if the promoter affects the transcription or expression of the coding 
sequence. Operably linked DNA sequences may be in close proximity or contiguous and, where 
necessary to join two protein coding regions, in the same reading frame. 

"Peptide nucleic acid" (PNA) refers to an antisense molecule or anti-gene agent which 
25 comprises an oligonucleotide of at least about 5 nucleotides in length linked to a peptide backbone 
of amino acid residues ending in lysine. The terminal lysine confers solubility to the composition. 
PNAs preferentially bind complementary single stranded DNA or RNA and stop transcript 
elongation, and may be pegylated to extend their lifespan in the cell. 

"Post-translational modification" of an TRICH may involve lipidation, glycosylation, 
30 phosphorylation, acetylation, racemization, proteolytic cleavage, and other modifications known in 
the art. These processes may occur synthetically or biochemically. Biochemical modifications will 
vary by cell type depending on the enzymatic milieu of TRICH. 

"Probe" refers to nucleic acid sequences encoding TRICH, their complements, or fragments 
thereof, which are used to detect identical, allelic or related nucleic acid sequences. Probes are 
35 isolated oligonucleotides or polynucleotides attached to a detectable label or reporter molecule. 
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Typical labels include radioactive isotopes, ligands, chemiluminescent agents, and enzymes. 
"Primers'* are short nucleic acids, usually DNA oligonucleotides, which may be annealed to a target 
polynucleotide by complementary base-pairing. The primer may then be extended along the target 
DNA strand by a DNA polymerase enzyme. Primer pairs can be used for amplification (and 
5 identification) of a nucleic acid sequence, e.g., by the polymerase chain reaction (PCR). 

Probes and primers as used in the present invention typically comprise at least 15 
contiguous nucleotides of a known sequence. In order to enhance specificity, longer probes and 
primers may also be employed, such as probes and primers that comprise at least 20, 25, 30, 40, 50, 
60, 70, 80, 90, 100, or at least 150 consecutive nucleotides of the disclosed nucleic acid sequences. 
10 Probes and primers may be considerably longer than these examples, and it is understood that any 
length supported by the specification, including the tables, figures, and Sequence Listing, may be 
used. 

Methods for preparing and using probes and primers are described in the references, for 
example Sambrook, J. et al. (1989) Molecular Cloning: A Laboratory Manual . 2 nd ed., vol. 1-3, Cold 

15 Spring Harbor Press, Plainview NY; Ausubel, F.M. et al. (1987) Current Protocols in Molecular 
Biology . Greene Publ. Assoc. & Wiley-Intersciences, New York NY; Innis, M. et al. (1990) PCR 
Protocols. A Guide to Methods and Applications . Academic Press, San Diego CA. PCR primer 
pairs can be derived from a known sequence, for example, by using computer programs intended for 
that purpose such as Primer (Version 0.5, 1991, Whitehead Institute for Biomedical Research, 

20 Cambridge MA). 

Oligonucleotides for use as primers are selected using software known in the art for such 
purpose. For example, OLIGO 4.06 software is useful for the selection of PCR primer pairs of up to 
100 nucleotides each, and for the analysis of oligonucleotides and larger polynucleotides of up to 
5,000 nucleotides from an input polynucleotide sequence of up to 32 kilobases. Similar primer 

25 selection programs have incorporated additional features for expanded capabilities. For example, 
the PrimOU primer selection program (available to the public from the Genome Center at University 
of Texas South West Medical Center, Dallas TX) is capable of choosing specific primers from 
megabase sequences and is thus useful for designing primers on a genome-wide scope. The Primer3 
primer selection program (available to the public from the Whitehead Institute/MIT Center for 

30 Genome Research, Cambridge MA) allows the user to input a "mispriming library," in which 
sequences t6 avoid as primer binding sites are user-specified. Primer3 is useful, in particular, for 
the selection of oligonucleotides for microarrays. (The source code for the latter two primer 
selection programs may also be obtained from their respective sources and modified to meet the 
user's specific needs.) The PrimeGen program (available to the public from the UK Human 

35 Genome Mapping Project Resource Centre, Cambridge UK) designs primers based on multiple 
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sequence alignments, thereby allowing selection of primers that hybridize to either the most 
conserved or least conserved regions of aligned nucleic acid sequences. Hence, this program is 
useful for identification of both unique and conserved oligonucleotides and polynucleotide 
fragments. The oligonucleotides and polynucleotide fragments identified by any of the above 
selection methods are useful in hybridization technologies, for example, as PCR or sequencing 
primers, microarray elements, or specific probes to identify fully or partially complementary 
polynucleotides in a sample of nucleic acids. Methods of oligonucleotide selection are not limited 
to those described above. 

A "recombinant nucleic acid" is a sequence that is not naturally occurring or has a sequence 
that is made by an artificial combination of two or more otherwise separated segments of sequence. 
This artificial combination is often accomplished by chemical synthesis or, more commonly, by the 
artificial manipulation of isolated segments of nucleic acids, e.g., by genetic engineering techniques 
such as those described in Sambrook, supra . The term recombinant includes nucleic acids that have 
been altered solely by addition, substitution, or deletion of a portion of the nucleic acid. Frequently, 
a recombinant nucleic acid may include a nucleic acid sequence operably linked to a promoter 
sequence. Such a recombinant nucleic acid may be part of a vector that is used, for example, to 
transform a cell. 

Alternatively, such recombinant nucleic acids may be part of a viral vector, e.g., based on a 
vaccinia virus, that could be use to vaccinate a mammal wherein the recombinant nucleic acid is 
expressed, inducing a protective immunological response in the mammal. 

A "regulatory element" refers to a nucleic acid sequence usually derived from untranslated 
regions of a gene and includes enhancers, promoters, introns, and 5* and 3' untranslated regions 
(UTRs). Regulatory elements interact with host or viral proteins which control transcription, 
translation, or RNA stability. 

"Reporter molecules" are chemical or biochemical moieties used for labeling a nucleic acid, 
amino acid, or antibody. Reporter molecules include radionuclides; enzymes; fluorescent, 
chemiluminescent, or chromogenic agents; substrates; cofactors; inhibitors; magnetic particles; and 
other moieties known in the art. 

An "RNA equivalent," in reference to a DNA sequence, is composed of the same linear 
sequence of nucleotides as the reference DNA sequence with the exception that all occurrences of 
the nitrogenous base thymine are replaced with uracil, and the sugar backbone is composed of 
ribose instead of deoxyribose. 

The term "sample" is used in its broadest sense. A sample suspected of containing TRICH, 
nucleic acids encoding TRICH, or fragments thereof may comprise a bodily fluid; an extract from a 
cell, chromosome, organelle, or membrane isolated from a cell; a cell; genomic DNA, RNA, or 
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cDNA, in solution or bound to a substrate; a tissue; a tissue print; etc. 

The terms "specific binding" and "specifically binding" refer to that interaction between a 
protein or peptide and an agonist, an antibody, an antagonist, a small molecule, or any natural or 
synthetic binding composition. The interaction is dependent upon the presence of a particular 

5 structure of the protein, e.g., the antigenic determinant or epitope, recognized by the binding 
molecule. For example, if an antibody is specific for epitope "A," the presence of a polypeptide 
comprising the epitope A, or the presence of free unlabeled A, in a reaction containing free labeled 
A and the antibody will reduce the amount of labeled A that binds to the antibody. 

The term "substantially purified" refers to nucleic acid or amino acid sequences that are 

10 removed from their natural environment and are isolated or separated, and are at least 60% free, 
preferably at least 75% free, and most preferably at least 90% free from other components with 
which they are naturally associated. •? 

: A "substitution" refers to the replacement of one or more amino acid residues or nucleotides 
by different amino acid residues or nucleotides, respectively. 

15 "Substrate" refers to any suitable rigid or semi-rigid support including membranes, filters, 

chips, slides, wafers, fibers, magnetic or nonmagnetic beads, gels, tubing, plates, polymers, 
microparticles and capillaries. The substrate can have a variety of surface forms, such as wells, 
trenches, pins, channels and pores, to which polynucleotides or polypeptides are bound. 

A "transcript image" or "expression profile" refers to the collective pattern of gene 

20 expression by a particular cell type or tissue under given conditions at a given time. 

"Transformation" describes a process by which exogenous DNA is introduced into a 
recipient cell. Transformation may occur under natural or artificial conditions according to various 
methods well known in the art, and may rely on any known method for the insertion of foreign 
nucleic acid sequences into a prokaryotic or eukaryotic host cell. The method for transformation is 

25 selected based on the type of host cell being transformed and may include, but is not limited to, 
bacteriophage or viral infection, electroporation, heat shock, lipofection, and particle bombardment 
The term "transformed cells" includes stably transformed cells in which the inserted DNA is 
capable of replication either as an autonomously replicating plasmid or as part of the host 
chromosome, as well as transiently transformed cells which express the inserted DNA or RNA for 

30 limited periods of time. 

A "transgenic organism," as used herein, is any organism, including but not limited to 
animals and plants, in which one or more of the cells of the organism contains heterologous nucleic 
acid introduced by way of human intervention, such as by transgenic techniques well known in the 
art. The nucleic acid is introduced into the cell, directly or indirectly by introduction into a 

35 precursor of the cell, by way of deliberate genetic manipulation, such as by microinjection or by 
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infection with a recombinant virus. The term genetic manipulation does not include classical 
cross-breeding, or in vitro fertilization, but rather is directed to the introduction of a recombinant 
DNA molecule. The transgenic organisms contemplated in accordance with die present invention 
include bacteria, cyanobacteria, fungi, plants and animals. The isolated DNA of the present 
5 invention can be introduced into the host by methods known in the art, for example infection, 
transfection, transformation or transconjugation. Techniques for transferring the DNA of the 
present invention into such organisms are widely known and provided in references such as 
Sambrook et al. (1989), supra . 

A "variant" of a particular nucleic acid sequence is defined as a nucleic acid sequence ~ 

10 having at least 40% sequence identity to the particular nucleic acid sequence over a certain length of 
one of the nucleic acid sequences using blastn with the "BLAST 2 Sequences" tool Version 2.0.9 
(May-07-1999) set at default parameters. Such a pair of nucleic acids may show, for example, at " 
least 50%, at least 60%, at least 70%, at least 80%, at least 85%, at least 90%, at least 91%, at least 
92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, or at least 98%, or greater 

15 sequence identity over a certain defined length. A variant may be described as, for example, an 
"allelic" (as defined above), "sphce," "species," or "polymorphic" variant. A splice variant may 
have significant identity to a reference molecule, but will generally have a greater or lesser number 
of polynucleotides due to alternate splicing of exons during mRNA processing. The corresponding 
polypeptide may possess additional functional domains or lack domains that are present in the 

20 reference molecule. Species variants are polynucleotide sequences that vary from one species to 
another. The resulting polypeptides will generally have significant amino acid identity relative to 
each other. A polymorphic variant is a variation in the polynucleotide sequence of a particular gene 
between individuals of a given species. Polymoiphic variants also may encompass "single 
nucleotide polymorphisms" (SNPs) in which the polynucleotide sequence varies by one nucleotide 

25 base. The presence of SNPs may be indicative of, for example, a certain population, a disease state, 
or a propensity for a disease state. 

A "variant" of a particular polypeptide sequence is defined as a polypeptide sequence 
having at least 40% sequence identity to the particular polypeptide sequence over a certain length of 
one of the polypeptide sequences using blastp with the "BLAST 2 Sequences" tool Version 2.0.9 

30 (May-07-1999) set at default parameters. Such a pair of polypeptides may show, for example, at 
least 50%, at least 60%, at least 70%, at least 80%, at least 90%, at least 91%, at least 92%, at least 
93%, at least 94%, at least 95%, at least 96%, at least 97%, or at least 98%, or greater sequence 
identity over a certain defined length of one of the polypeptides. 

35 THE INVENTION 
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The invention is based on the discovery of new human transporters and ion channels 
(TRICH), the polynucleotides encoding TRICH, and the use of these compositions for the diagnosis, 
treatment, or prevention of transport, neurological, muscle, immunological and cell proliferative 
disorders. 

5 Table 1 summarizes the nomenclature for the full length polynucleotide and polypeptide 

sequences of the invention. Each polynucleotide and its corresponding polypeptide are correlated to 
a single Incyte project identification number (Incyte Project ID). Each polypeptide sequence is 
denoted by both a polypeptide sequence identification number (Polypeptide SEQ ID NO:) and an 
Incyte polypeptide sequence number (Incyte Polypeptide ID) as shown. Each polynucleotide 

10 sequence is denoted by both a polynucleotide sequence identification number (Polynucleotide SEQ 
ID NO:) and an Incyte polynucleotide consensus sequence number (Incyte Polynucleotide ID) as 
shown. ' 

Table 2 shows sequences with homology to the polypeptides of the invention as identified 
by BLAST analysis against the GcnBank protein (genpept) database. Columns 1 and 2 show the 

15 polypeptide sequence identification number (Polypeptide SEQ ID NO:) and the corresponding 
Incyte polypeptide sequence number (Incyte Polypeptide ID) for polypeptides of the invention. 
Column 3 shows the GenBank identification number (GenBank ID NO:) of the nearest GenBank 
homolog. Column 4 shows the probability scores for the matches between each polypeptide and its 
homolog(s). Column 5 shows the annotation of the GenBank homolog(s) along with relevant 

20 citations where applicable, all of which are expressly incorporated by reference herein. 

Table 3 shows various structural features of the polypeptides of the invention. Columns 1 
and 2 show the polypeptide sequence identification number (SEQ ID NO:) and the corresponding 
Incyte polypeptide sequence number (Incyte Polypeptide ID) for each polypeptide of the invention. 
Column 3 shows the number of amino acid residues in each polypeptide. Column 4 shows potential 

25 phosphorylation sites, and column 5 shows potential glycosylation sites, as determined by the 
MOTIFS program of the GCG sequence analysis software package (Genetics Computer Group, 
Madison WI). Column 6 shows amino acid residues comprising signature sequences, domains, and 
motifs. Column 7 shows analytical methods for protein structure/function analysis and in some 
cases, searchable databases to which the analytical methods were applied. 

30 Together, Tables 2 and 3 summarize the properties of polypeptides of the invention, and 

these properties establish that the claimed polypeptides are transporters and ion channels. For 
example, SEQ ID NO:l is 83% identical to human sodium-hydrogen exchanger 6 (GenBank ID 
g2944233) as determined by the Basic Local Alignment Search Tool (BLAST). (See Table 2.) The 
BLAST probability score is 5.1e-242, which indicates the probability of obtaining the observed 

35 polypeptide sequence alignment by chance. SEQ ID NO: 1 also contains a sodium/hydrogen 
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exchanger family domain as determined by searching for statistically significant matches in the 
hidden Markov model (HMM)-based PFAM database of conserved protein family domains. (See 
Table 3.) Data from BLIMPS, and other BLAST analyses provide further corroborative evidence 
that SEQ ID NO:l is a sodium-hydrogen exchange transporter. In another example, SEQ ID NO:7 
5 is 85% identical to Rattus norvegicus Na7,K + -ATPase alpha subunit (GenBarik ID g619915) as 
determined by the Basic Local Alignment Search Tool (BLAST). (See Table 2.) The BLAST 
probability score is 0.0, which indicates the probability of obtaining the observed polypeptide 
sequence alignment by chance. SEQ ID NO:7 also contains an E1-E2 ATPase domain as 
determined by searching for statistically significant matches in the hidden Markov model (HMM)- 

10 based PFAM database of conserved protein family domains. (See Table 3.) Data from BLIMPS, 
MOTIFS, and PROFELESCAN analyses provide further corroborative evidence that SEQ ID NO:7 
is a cation-transporting ATPase. In yet another example, SEQ ID NO: 13 is 77% identical to a - ' 
human carrier-like protein (GenBank ID g3694661) as determined by the Basic Local Alignment 
Search Tool (BLAST). (See Table 2.) The BLAST probability score is 5.5e-209, which indicates 

15 the probability of obtaining the observed polypeptide sequence alignment by chance. SEQ ID 

NO: 13 also contains a mitochondrial energy transfer protein domain as determined by searching for 
statistically significant matches in the hidden Markov model (HMM)-based PFAM database of 
conserved protein family domains. (See Table 3.) The presence of this domain is confirmed by 
BLIMPS, MOTIFS, and PROFILES CAN analyses, providing further corroborative evidence that 

20 SEQ ID NO: 13 is a transporter. Further, SEQ ID NO: 16 is 41% identical to human novel ATPase 
(GenBank ID g897980l) as determined by the Basic Local Alignment Search Tool (BLAST). (See 
Table 2.) The BLAST probability score is l.le-165, which indicates the probability of obtaining the 
observed polypeptide sequence alignment by chance. SEQ ID NO: 16 also contains an E1-E2 
ATPase domain as determined by searching for statistically significant matches in the hidden 

25 Markov model (HMM)-based PFAM database of conserved protein family domains. (See Table 3.) 
Data from BLIMPS, MOTIFS, and PROFILESCAN analyses provide further corroborative evidence 
that SEQ ID NO: 16 is a cation-transporting ATPase. In a further example, SEQ ID NO: 19 is 43% 
identical to Sinorhizobium sp. As4 ArsA, the catalytic subunit of the arsenic oxyanion-translocating 
ATPase (GenBank ID g5802945) as determined by the Basic Local Alignment Search Tool 

30 (BLAST). (See Table 2.) The BLAST probability score is 7.7e-125, which indicates the probability 
of obtaining the observed polypeptide sequence alignment by chance. SEQ ID NO: 19 also contains 
an anion-transporting ATPase domain as determined by searching for statistically significant 
matches in the hidden Markov model (HMM)-based PFAM database of conserved protein family 
domains. (See Table 3.) Data from additional BLAST and PROFILESCAN analyses provide 

35 further corroborative evidence that SEQ ID NO: 19 is an anion-transporting ATPase. In yet a further 
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example, SEQ ID NO:21 is 54% identical to a murine putative E1-E2 ATPase (GenBank ID 
g28577) as determined by the Basic Local Alignment Search Tool (BLAST). (See Table 2.) The 
BLAST probability score is 5.2e-190, which indicates the probability of obtaining the observed 
polypeptide sequence alignment by chance. SEQ ID NO:21 also contains six transmembrane 
5 domains as determined using TMAP, a program which delineates transmembrane segments. (See 
Table 3.) Data from BLIMPS, and MOTIFS, analyses provide further corroborative evidence that 
SEQ ID NO:21 is an ATPase. In a futher example, SEQ ID NO:24 is 52% identical, from residue 
A77 to residue L1007, to rat NMDAR-L (GenBank ID g2160125) as determined by the Basic Local 
Alignment Search Tool (BLAST). (See Table 2.) The BLAST probability score is 1.5e-262, which 

10 indicates the probability of obtaining the observed polypeptide sequence alignment by chance. SEQ 
ID NO: 24 also contains a ligand gated ion channel domain as determined by searching for 
statistically significant matches in the hidden Markov model (HMM)-based PFAM database of "* 
conserved protein family domains. (See Table 3.) Data from BLIMPS analysis provide further 
corroborative evidence that SEQ ID NO:24 is a glutamate receptor. SEQ ID NO:2-6, SEQ ID NO:8- 

15 12, SEQ ID NO:14-15, SEQ ID NO: 17-18, SEQ ID NO:20, SEQ ID NO:22-23, and SEQ ID NO:25- 
32 were analyzed and annotated in a similar manner. The algorithms and parameters for the 
analysis of SEQ ID NO: 1-32 are described in Table 7. 

As shown in Table 4, the full length polynucleotide sequences of the present invention were 
assembled using cDNA sequences or coding (exon) sequences derived from genomic DNA, or any 

20 combination of these two types of sequences. Column 1 lists the polynucleotide sequence 
identification number (Polynucleotide SEQ ID NO:), the corresponding Incyte polynucleotide 
consensus sequence number (Incyte ID) for each polynucleotide of the invention, and the length of 
each polynucleotide sequence in basepairs. Column 2 shows the nucleotide start (5') and stop (3') 
positions of the cDNA and/or genomic sequences used to assemble the full length polynucleotide 

25 sequences of the invention, and of fragments of the polynucleotide sequences which are useful, for 
example, in hybridization or amplification technologies that identify SEQ ID NO:33-64 or that 
distinguish between SEQ ID NO:33-64 and related polynucleotide sequences. 

The polynucleotide fragments described in Column 2 of Table 4 may refer specifically, for 
example, to Incyte cDNAs derived from tissue-specific cDNA libraries or from pooled cDNA 

30 libraries. Alternatively, the polynucleotide fragments described in column 2 may refer to GenBank 
cDNAs or ESTs which contributed to the assembly of the full length polynucleotide sequences. In 
addition, the polynucleotide fragments described in column 2 may identify sequences derived from 
the ENSEMBL (The Sanger Centre, Cambridge, UK) database \ie. 9 those sequences including the 
designation "ENST"). Alternatively, the polynucleotide fragments described in column 2 may be 

35 derived from the NCBI RefSeq Nucleotide Sequence Records Database (i.e., those sequences 
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including the designation "NM" or "NT") or the NCBI RefSeq Protein Sequence Records (£«., 
those sequences including the designation "NP"). Alternatively, the polynucleotide fragments 
described in column 2 may refer to assemblages of both cDNA and Genscan-predicted exons 
brought together by an "exon stitching" algorithm. For example, a polynucleotide sequence 
5 identified as FLJCXXXXXJl^N^YYYYY^N, represents a "stitched" sequence in which 
XXXXXX is the identification number of the cluster of sequences to which the algorithm was 
applied, and YYYYY is the number of the prediction generated by the algorithm, and Nj 2 3 , if 
present, represent specific exons that may have been manually edited during analysis (See Example 
V). Alternatively, the polynucleotide fragments in column 2 may refer to assemblages of exons 

10 brought together by an "exon-stretching" algorithm. For example, a polynucleotide sequence 
identified as Fl^XXXX_gAAAAA_gBBBBB_l_N is a "stretched" sequence, with XXXXXX being 
the Incyte project identification number, gAAAAA being the GenBank identification number of the ' 
human genomic sequence to which the "exon-stretching" algorithm was applied, gBBBBB being the 
GenBank identification number or NCBI RefSeq identification number of the nearest GenBank 

15 protein homolog. and /V referring lo specific exons (See Example V). In instances where a RefSeq 
sequence was used as a protein homolog for the "exon-stretching" algorithm, a RefSeq identifier 
(denoted by "MM,- "NP.- or * 4 NT~) may be used in place of the GenBank identifier {i.e., gBBBBB). 

Alternatively, a prefix identifies component sequences that were hand-edited, predicted 
from genomic DNA sequences, or derived from a combination of sequence analysis methods. The 

20 following Table lists examples of component sequence prefixes and corresponding sequence 
analysis methods associated with the prefixes (see Example IV and Example V). 



25 



Prefix 


Type of analysis and/or examples of programs 


GNN, GFG. 
ENST 


Exon prediction from genomic sequences using, for example, 
GENSCAN (Stanford University, CA, USA) or FGENES 
(Computer Genomics Group, The Sanger Centre, Cambridge, UK). 


GBI 


Hand-edited analysis of genomic sequences. 


FL 


Stitched or stretched genomic sequences (see Example V). 


INCY 


Full length transcript and exon prediction from mapping of EST 
sequences to the genome. Genomic location and EST composition 
data are combined to predict the exons and resulting transcript. 



In some cases, Incyte cDNA coverage redundant with the sequence coverage shown in 
30 Table 4 was obtained to confirm the final consensus polynucleotide sequence, but the relevant 
Incyte cDNA identification numbers are not shown. 

Table 5 shows the representative cDNA libraries for those full length polynucleotide 
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sequences which were assembled using Incyte cDNA sequences. The representative cDNA library 
is the Incyte cDNA library which is most frequently represented by the Incyte cDNA sequences 
which were used to assemble and confirm the above polynucleotide sequences. The tissues and 
vectors which were used to construct the cDNA libraries shown in Table 5 are described in Table 6. 

5 The invention also encompasses TRICH variants. A preferred TRICH variant is one which 

has at least about 80%, or alternatively at least about 90%, or even at least about 95% amino acid 
sequence identity to the TRICH amino acid sequence, and which contains at least one functional or 
structural characteristic of TRICH. 

The invention also encompasses polynucleotides which encode TRICH. In a particular 

10 embodiment, the invention encompasses a polynucleotide sequence comprising a sequence selected 
from the group consisting of SEQ ID NO:33-64, which encodes TRICH. The polynucleotide 
sequences of SEQ ID NO:33-64, as presented in the Sequence Listing, embrace the equivalent RNA 
sequences, wherein occurrences of the nitrogenous base thymine are replaced with uracil, and the 
sugar backbone is composed of ribose instead of deoxyribose. 

15 The invention also encompasses a variant of a polynucleotide sequence encoding TRICH. 

In particular, such a variant polynucleotide sequence will have at least about 70%, or alternatively at 
least about 85%, or even at least about 95% polynucleotide sequence identity to the polynucleotide 
sequence encoding TRICH. A particular aspect of the invention encompasses a variant of a 
polynucleotide sequence comprising a sequence selected from the group consisting of SEQ ID 

20 NO:33-64 which has at least about 70%, or alternatively at least about 85%, or even at least about 
95% polynucleotide sequence identity to a nucleic acid sequence selected from the group consisting 
of SEQ ID NO:33-64. Any one of the polynucleotide variants described above can encode an amino 
acid sequence which contains at least one functional or structural characteristic of TRICH. . 

In addition, or in the alternative, a polynucleotide variant of the invention is a splice variant 

25 of a polynucleotide sequence encoding TRICH. A splice variant may have portions which have 
significant sequence identity to the polynucleotide sequence encoding TRICH, but will generally 
have a greater or lesser number of polynucleotides due to additions or deletions of blocks of 
sequence arising from alternate splicing of exons during mRNA processing. A splice variant may 
have less than about 70%, or alternatively less than about 60%, or alternatively less than about 50% 

30 polynucleotide sequence identity to the polynucleotide sequence encoding TRICH over its entire 
length; however, portions of the splice variant will have at least about 70%, or alternatively at least 
about 85%, or alternatively at least about 95%, or alternatively 100% polynucleotide sequence 
identity to portions of the polynucleotide sequence encoding TRICH. Any one of the splice variants 
described above can encode an amino acid sequence which contains at least one functional or 

35 structural characteristic of TRICH. 
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It will be appreciated by those skilled in the art that as a result of the degeneracy of the 
genetic code, a multitude of polynucleotide sequences encoding TRICH, some bearing minimal 
similarity to the polynucleotide sequences of any known and naturally occurring gene, may be - 
produced. Thus, the invention contemplates each and every possible variation of polynucleotide 
5 sequence that could be made by selecting combinations based on possible codon choices. These 
combinations are made in accordance with the standard triplet genetic code as applied to the 
polynucleotide sequence of naturally occurring TRICH, and all such variations are to be considered 
as being specifically disclosed. 

Although nucleotide sequences which encode TRICH and its variants are generally capable 
10 of hybridizing to the nucleotide sequence of the naturally occurring TRICH under appropriately 
selected conditions of stringency, it may be advantageous to produce nucleotide sequences encoding 
TRICH or its derivatives possessing a substantially different codon usage, e.g., inclusion of non- " 
naturally occurring codons. Codons may be selected to increase the rate at which expression of the 
peptide occurs in a particular prokaryotic or eukaryotic host in accordance with the frequency with 
15 which particular codons are utilized by the host. Other reasons for substantially altering the 

nucleotide sequence encoding TRICH and its derivatives without altering the encoded amino acid 
sequences include the production of RNA transcripts having more desirable properties, such as a 
greater half-life, than transcripts produced from the naturally occurring sequence. 

The invention also encompasses production of DNA sequences which encode TRICH and 
20 TRICH derivatives, or fragments thereof, entirely by synthetic chemistry. After production, the 
synthetic sequence may be inserted into any of the many available expression vectors and cell 
systems using reagents well known in the art. Moreover, synthetic chemistry may be used to 
introduce mutations into a sequence encoding TRICH or any fragment thereof. 

Also encompassed by the invention are polynucleotide sequences that are capable of 
hybridizing to the claimed polynucleotide sequences, and, in particular, to those shown in SEQ ID 
NO:33-64 and fragments thereof under various conditions of stringency. (See, e.g., Wahl, G.M. and 
S.L. Berger (1987) Methods Enzymol. 152:399^07; Kimmel, A.R. (1987) Methods Enzymol. 
152:507-511.) Hybridization conditions, including annealing and wash conditions, are described in 
'Ttefinitions." 

Methods for DNA sequencing are well known in the art and may be used to practice any of 
the embodiments of the invention. The methods may employ such enzymes as the Klenow fragment 
of DNA polymerase I, SEQUENASE (US Biochemical, Cleveland OH), Taq polymerase (Applied 
Biosystems), thermostable T7 polymerase (Amersham Pharmacia Biotech, Piscataway NJ), or 
combinations of polymerases and proofreading exonucleases such as those found in the 
ELONGASE amplification system (Life Technologies, Gaithersburg MD). Preferably, sequence 

40 



V 



WO 02/46415 



PCT/USOJ/46963 



preparation is automated with machines such as the MICROLAB 2200 liquid transfer system 
(Hamilton, Reno NV), PTC200 thermal cycler (MJ Research, Watertown MA) and ABI 
CATALYST 800 thermal cycler (Applied Biosystems). Sequencing is then carried out using either 
the ABI 373 or 377 DNA sequencing system (Applied Biosystems), the MEGAB ACE 1000 DNA 
5 sequencing system (Molecular Dynamics, Sunnyvale CA), or other systems known in the art. The 
resulting sequences are analyzed using a variety of algorithms which are well known in the art. 
(See, e.g., Ausubel, RM. (1997) Short Protocols in Molecular Biology . John Wiley & Sons, New 
York NY, unit 7.7; Meyers, R.A. (1995) Molecular Biology and Biotechnology . Wiley VCH, New 
York NY, pp. 856-853.) 

10 The nucleic acid sequences encoding TRICH may be extended utilizing a partial nucleotide 

sequence and employing various PCR-based methods known in the art to detect upstream 
sequences, such as promoters and regulatory elements. For example, one method which may be ■ ' ' 
employed, restriction-site PCR, uses universal and nested primers to amplify unknown sequence 
from genomic DNA within a cloning vector. (See, e.g., Sarkar, G. (1993) PCR Methods Applic. 

15 2:3 18-322.) Another method, inverse PCR, uses primers that extend in divergent directions to 
amplify unknown sequence from a circularized template. The template is derived from restriction 
fragments comprising a known genomic locus and surrounding sequences. (See, e.g., Triglia, T. et 
al. (1988) Nucleic Acids Res. 16:8186.) A third method, capture PCR, involves PCR amplification 
of DNA fragments adjacent to known sequences in human and yeast artificial chromosome DNA. 

20 (See, e.g., Lagerstrom, M. et al (1991) PCR Methods Applic. 1:111-1 19.) In this method, multiple 
restriction enzyme digestions and ligations may be used to insert an engineered double-stranded 
sequence into a region of unknown sequence before performing PCR. Other methods which may be 
used to retrieve unknown sequences are known in the art. (See, e.g., Parker, J.D. et al. (1991) 
Nucleic Acids Res. 19:3055-3060). Additionally, one may use PCR/nested primers, and 

25 PROMOTERFINDER libraries (Clontech, Palo Alto CA) to walk genomic DNA. This procedure 
avoids the need to screen libraries and is useful in finding intron/exon junctions. For all PCR-based 
methods, primers may be designed using commercially available software, such as OLIGO 4.06 
primer analysis software (National Biosciences, Plymouth MN) or another appropriate program, to 
be about 22 to 30 nucleotides in length, to have a GC content of about 50% or more, and to anneal 

30 to the template at temperatures of about 68°C to 72°C. 

When screening for full length cDNAs, it is preferable to use libraries that have been 
size-selected to include larger cDNAs. In addition, random-primed libraries, which often include 
sequences containing the 5' regions of genes, are preferable for situations in which an oligo d(T) 
library does not yield a full-length cDNA. Genomic libraries may be useful for extension of 

35 sequence into 5' non-transcribed regulatory regions. 
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Capillary electrophoresis systems which are commercially available may be used to analyze 
the size or confirm the nucleotide sequence of sequencing or PCR products. In particular, capillary 
sequencing may employ flowable polymers for electrophoretic separation, four different nucleotide- 
specific, laser-stimulated fluorescent dyes, and a charge coupled device camera for detection of the 
5 emitted wavelengths. Output/light intensity may be converted to electrical signal using appropriate 
software (e.g., GENOTYPER and SEQUENCE NAVIGATOR, Applied Biosystems), and the entire 
process from loading of samples to computer analysis and electronic data display may be computer 
controlled. Capillary electrophoresis is especially preferable for sequencing small DNA fragments 
which may be present in limited amounts in a particular sample. 

10 In another embodiment of the invention, polynucleotide sequences or fragments thereof 

which encode TRICH may be cloned in recombinant DNA molecules that direct expression of 
TRICH, or fragments or functional equivalents thereof, in appropriate host cells. Due to the 
inherent degeneracy of the genetic code, other DNA sequences which encode substantially the same 
or a functionally equivalent amino acid sequence may be produced and used to express TRICH. 

15 The nucleotide sequences of the present invention can be engineered using methods 

generally known in the art in order to alter TRICH-encoding sequences for a variety of purposes 
including, but not limited to, modification of the cloning, processing, and/or expression of the gene 
product. DNA shuffling by random fragmentation and PCR reassembly of gene fragments and 
synthetic oligonucleotides may be used to engineer the nucleotide sequences. For example, 

20 oligonucleotide-mediated site-directed mutagenesis may be used to introduce mutations that create 
new restriction sites, alter glycosylation patterns, change codon preference, produce splice variants, 
and so forth. 

The nucleotides of the present invention may be subjected to DNA shuffling techniques 
such as MOLECUL ARB REEDING (Maxygen Inc., Santa Clara CA; described in U.S. Patent No. 

25 5,837,458; Chang, C.-C et al. (1999) Nat. Biotechnol. 17:793-797; Christians, F.C et al. (1999) 
Nat. Biotechnol. 17:259-264; and Crameri, A. et al. (1996) Nat. Biotechnol. 14:315-319) to alter or 
improve the biological properties of TRICH, such as its biological or enzymatic activity or its 
ability to bind to other molecules or compounds. DNA shuffling is a process by which a library of 
gene variants is produced using PCR-mediated recombination of gene fragments. The library is 

30 then subjected to selection or screening procedures that identify those gene variants with the desired 
properties. These preferred variants may then be pooled and further subjected to recursive rounds 
of DNA shuffling and selection/screening. Thus, genetic diversity is created through "artificial" 
breeding and rapid molecular evolution. For example, fragments of a single gene containing 
random point mutations may be recombined, screened, and then reshuffled until the desired 

35 properties are optimized. Alternatively, fragments of a given gene may be recombined with 
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fragments of homologous genes in the same gene family, either from the same or different species, 
thereby maximizing the genetic diversity of multiple naturally occurring genes in a directed and 
controllable manner. 

In another embodiment, sequences encoding TRICH may be synthesized, in whole or in 
5 part, using chemical methods well known in the art. (See, e.g., Caruthers, M.H. et aL (1980) 
Nucleic Acids Symp. Ser. 7:215-223; and Horn, T. et al. (1980) Nucleic Acids Symp. Ser. 
7:225-232.) Alternatively, TRICH itself or a fragment thereof may be synthesized using chemical 
methods. For example, peptide synthesis can be performed using various solution-phase or 
solid-phase techniques. (See, e.g., Creighton, T. (1984^ Proteins, Structures and Molecular 

10 Properties, WH Freeman, New York NY, pp. 55-60; and Roberge, J.Y. et al. (1995) Science 
269:202-204.) Automated synthesis may be achieved using the ABI 431 A peptide synthesizer 
(Applied Biosystems). Additionally, the amino acid sequence of TRICH, or any part thereof, may - 
be altered during direct synthesis and/or combined with sequences from other proteins, or any part 
thereof, to produce a variant polypeptide or a polypeptide having a sequence of a naturally occurring 

15 polypeptide. 

The peptide may be substantially purified by preparative high performance liquid 
chromatography. (See, e.g., Chiez, R.M. and F.Z. Regnier (1990) Methods Enzymol. 182:392-421.) 
The composition of the synthetic peptides may be confirmed by amino acid analysis or by 
sequencing. (See, e.g., Creighton, supra , pp. 28-53.) 

20 In order to express a biologically active TRICH, the nucleotide sequences encoding TRICH 

or derivatives thereof may be inserted into an appropriate expression vector, i.e., a vector which 
contains the necessary elements for transcriptional and translational control of the inserted coding 
sequence in a suitable host. These elements include regulatory sequences, such as enhancers, 
constitutive and inducible promoters, and 5' and 3* untranslated regions in the vector and in 

25 polynucleotide sequences encoding TRICH. Such elements may vary in their strength and 
specificity. Specific initiation signals may also be used to achieve more efficient translation of 
sequences encoding TRICH. Such signals include the ATG initiation codon and adjacent 
sequences, e.g. the Kozak sequence. In cases where sequences encoding TRICH and its initiation 
codon and upstream regulatory sequences are inserted into the appropriate expression vector, no 

30 additional transcriptional or translational control signals may be needed. However, in cases where 
only coding sequence, or a fragment thereof, is inserted, exogenous translational control signals 
including an in-frame ATG initiation codon should be provided by the vector. Exogenous 
translational elements and initiation codons may be of various origins, both natural and synthetic: 
The efficiency of expression may be enhanced by the inclusion of enhancers appropriate for the 

35 particular host cell system used. (See, e.g., Scharf, D. et al. (1994) Results Probl. Cell Differ. 
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20:125-162.) 

Methods which are well known to those skilled in the art may be used to construct 
expression vectors containing sequences encoding TRICH and appropriate transcriptional and 
translational control elements. These methods include in vitro recombinant DNA techniques, 
5 synthetic techniques, and in vivo genetic recombination. (See, e.g., Sambrook, J. et al. (1989) 
Molecular Cloning. A Laboratory Manual . Cold Spring Harbor Press, Plainview NY, ch. 4, 8, and 
16-17; Ausubel, F.M. et al. (1995) Current Protocols in Molecular Biology . John Wiley & Sons, 
New York NY, ch. 9, 13, and 16.) 

A variety of expression vector/host systems may be utilized to contain and express 

10 sequences encoding TRICH. These include, but are not limited to, microorganisms such as bacteria 
transformed with recombinant bacteriophage, plasmid, or cosmid DNA expression vectors; yeast 
transformed with yeast expression vectors; insect cell systems infected with viral expression vectors 
(e.g., baculovirus); plant cell systems transformed with viral expression vectors (e.g., cauliflower 
mosaic virus, CaMV, or tobacco mosaic virus, TMV) or with bacterial expression vectors (e.g., Ti 

15 or pBR322 plasmids); or animal cell systems. (See, e.g., Sambrook, supra ; Ausubel, supra : Van 
Heeke, G. and S.M. Schuster (1989) J. Biol. Chem. 264:5503-5509; Engelhardi E.K. et al. (1994) 
Proc. Natl. Acad. Sci. USA 91:3224-3227; Sandig, V. et al. (1996) Hum. Gene Ther. 7: 1937-1945; 
Takamatsu, N. (1987) EMBO J. 6:307-311; The McGraw Hill Yearbook of Science and Technology 
(1992) McGraw Hill, New York NY, pp. 191-196; Logan, J. and T. Shenk (1984) Proc. Natl. Acad. 

20 Sci. USA 81:3655-3659; and Harrington, JJ. et al. (1997) Nat. Genet. 15:345-355.) Expression 
vectors derived from retroviruses, adenoviruses, or herpes or vaccinia viruses, or from various 
bacterial plasmids, may be used for delivery of nucleotide sequences to the targeted organ, tissue, or 
cell population. (See, e.g., Di Nicola, M. et al. (1998) Cancer Gen. Ther. 5(6):350-356; Yu, M. et 
al. (1993) Proc. Nad. Acad. Sci. USA 90(13):6340-6344; Buller, R.M. et al. (1985) Nature 

25 317(6040):813-815; McGregor, D.P. et al. (1994) Mol. Immunol. 31(3):219-226; and Verma, LM. 
and N. Somia (1997) Nature 389:239-242.) The invention is not limited by the host cell employed. 

In bacterial systems, a number of cloning and expression vectors may be selected depending 
upon the use intended for polynucleotide sequences encoding TRICH. For example, routine 
cloning, subcloning, and propagation of polynucleotide sequences encoding TRICH can be achieved 

30 using a multifunctional E. coli vector such as PBLUESCRIPT (Stratagene, La Jolla CA) or 

PSPORT1 plasmid (Life Technologies). Ligation of sequences encoding TRICH into the vector's 
multiple cloning site disrupts the lacZ gene, allowing a colorimetric screening procedure for 
identification of transformed bacteria containing recombinant molecules. In addition, these vectors 
may be useful for in vitro transcription, dideoxy sequencing, single strand rescue with helper phage, 

35 and creation of nested deletions in the cloned sequence. (See, e.g., Van Heeke, G. and S.M. 
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Schuster (1989) J. Biol. Chem. 264:5503-5509.) When large quantities of TRICH are needed, e.g. 
for the production of antibodies, vectors which direct high level expression of TRICH may be used. 
For example, vectors containing the strong, inducible SP6 or T7 bacteriophage promoter may be 
used. 

5 Yeast expression systems may be used for production of TRICH. A number of vectors 

containing constitutive or inducible promoters, such as alpha factor, alcohol oxidase, and PGH 
promoters, may be used in the yeast Saccharomvces cerevisiae or Pichia pastoris . In addition, such 
vectors direct either the secretion or intracellular retention of expressed proteins and enable 
integration of foreign sequences into the host genome for stable propagation. (See, e.g., Ausubel, 

10 1995, supra ; Bitter, G.A. et al. (1987) Methods Enzymol. 153:516-544; and Scorer, C.A. et al. 
(1994) Bio/Technology 12:181-184.) 

Plant systems may also be used for expression of TRICH. Transcription of sequences 
encoding TRICH may be driven by viral promoters, e.g., the 35S and 19S promoters of CaMV used 
alone or in combination with the omega leader sequence fromTMV (Takamatsu, N. (1987) EMBO 

15 J. 6:307-31 1). Alternatively, plant promoters such as the small subunit of RUBISCO or heat shock 
promoters may be used. (See, e.g., Coruzzi, G. et al. (1984) EMBO J. 3:1671-1680; Broglie, R. et ' 
al. (1984) Science 224:838-843; and Winter, J. et al. (1991) Results Probl. Cell Differ. 17:85-105.) 
These constructs can be introduced into plant cells by direct DNA transformation or 
"■'pathogen-mediated transfection. (See, e.g., The McGraw Hill Yearbook of Science and Technology 

20 (1992) McGraw Hill, New York NY, pp. 19M96.) 

In mammalian cells, a number of viral-based expression systems may be utilized In cases 
where an adenovirus is used as an expression vector, sequences encoding TRICH may be ligated 
into an adenovirus transcription/translation complex consisting of the late promoter and tripartite 
leader sequence. Insertion in a non-essential El or E3 region of the viral genome may be used to 

25 obtain infective virus which expresses TRICH in host cells. (See, e.g., Logan, J. and T. Shenk 

(1984) Proc. Natl. Acad. Set USA 81:3655-3659.) In addition, transcription enhancers, such as the 
Rous sarcoma virus (RSV) enhancer, may be used to increase expression in mammalian host cells. 
SV40 or EBV-based vectors may also be used for high-level protein expression. 

Human artificial chromosomes (HACs) may also be employed to deliver larger fragments of 

30 DNA than can be contained in and expressed from a plasmid. HACs of about 6 kb to 10 Mb are 
constructed and delivered via conventional delivery methods (liposomes, polycationic amino 
polymers, or vesicles) for therapeutic purposes. (See, e.g., Harrington, JJ. et al. (1997) Nat. Genet. 
15:345-355.) 

For long term production of recombinant proteins in mammalian systems, stable expression 
35 of TRICH in cell lines is preferred. For example, sequences encoding TRICH can be transformed 
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into cell lines using expression vectors which may contain viral origins of replication and/or 
endogenous expression elements and a selectable marker gene on the same or on a separate vector. 
Following the introduction of the vector, cells may be allowed to grow for about 1 to 2 days in 
enriched media before being switched to selective media. The purpose of the selectable marker is to 
5 confer resistance to a selective agent, and its presence allows growth and recovery of cells which 
successfully express the introduced sequences. Resistant clones of stably transformed cells may be 
propagated using tissue culture techniques appropriate to the cell type. 

Any number of selection systems may be used to recover transformed cell lines. These 
include, but are not limited to, the herpes simplex virus thymidine kinase and adenine 
10 phosphoribosyltransferase genes, for use in tk and apf cells, respectively. (See, e.g., Wigler, M. et 
al. (1977) Cell 11:223-232; Lowy, I. et al. (1980) Cell 22:S17-823.) Also, antimetabolite, antibiotic, 
or herbicide resistance can be used as the basis for selection. For example, dhfr confers resistance*' 
to methotrexate; neo confers resistance to the aminoglycosides neomycin and G-418; and als and 
pat confer resistance to chlorsulfuron and phosphinotricin acetyltransferase, respectively. (See, e.g., 
15 Wigler, M. et al. (1980) Proc. Natl. Acad. ScL USA 77:3567-3570; Colbere-Garapin, F. et al. (1981) 
J. Mol. Biol. 150: 1-14.) Additional selectable genes have been described, e.g., trpB and hisD y 
which alter cellular requirements for metabolites. (See, e.g., Hartman, S.C. and R.C. Mulligan 
(1988) Proc. Natl. Acad. Sci. USA 85:8047-8051.) Visible markers, e.g., anthocyanins, green 
fluorescent proteins (GFP; Clontech), 6 glucuronidase and its substrate B-glucuronide, or luciferase 
and its substrate luciferin may be used. These markers can be used not only to identify 
transformants, but also to quantify the amount of transient or stable protein expression attributable 
to a specific vector system. (See, e.g., Rhodes, C.A. (1995) Methods Mol. Biol. 55:121-131.) 

Although the presence/absence of marker gene expression suggests that the gene of interest 
is also present, the presence and expression of the gene may need to be confirmed. For example, if 
the sequence encoding TRICH is inserted within a marker gene sequence, transformed cells 
containing sequences encoding TRICH can be identified by the absence of marker gene function. 
Alternatively, a marker gene can be placed in tandem with a sequence encoding TRICH under the 
control of a single promoter. Expression of the marker gene in response to induction or selection 
usually indicates expression of the tandem gene as well. 

In general, host cells that contain the nucleic acid sequence encoding TRICH and that 
express TRICH may be identified by a variety of procedures known to those of skill in the art. 
These procedures include, but are not limited to, DNA-DNA or DNA-RNA hybridizations, PCR 
amplification, and protein bioassay or immunoassay techniques which include membrane, solution, 
or chip based technologies for the detection and/or quantification of nucleic acid or protein 
sequences. 
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Immunological methods for detecting and measuring the expression of TRICH using either 
specific polyclonal or monoclonal antibodies are known in the art. Examples of such techniques 
include enzyme-linked immunosorbent assays (ELISAs), radioimmunoassays (RIAs), and 
fluorescence activated cell sorting (FACS). A two-site, monoclonal-based immunoassay utilizing 
5 monoclonal antibodies reactive to two non-interfering epitopes on TRICH is preferred, but a 
competitive binding assay may be employed. These and other assays are well known in the art. 
(See, e.g., Hampton, R. et al. (1990) Serological Methods, a Laboratory Manual , APS Press, St. Paul 
MN, Sect. IV; Coligan, J.E. et al. (1997) Current Protocols in Immunology , Greene Pub. Associates 
and Wiley-Intersctence, New York NY; and Pound, J.D. (1998) Immunochemical Protocols . 
10 Humana Press, Totowa N J.) 

A wide variety of labels and conjugation techniques are known by those skilled in the art 
and may be.used in-various nucleic acid and amino acid assays. Means for producing labeled 
hybridization or PCR probes for detecting sequences related to polynucleotides encoding TRICH 
include oligolabchng. nick translation, end-labeling, or PCR amplification using a labeled 
15 nucleotide. Alternatively, the sequences encoding TRICH, or any fragments thereof, may be cloned 
into a vector for the production of an mRNA probe. Such vectors are known in the art, are 
commercially available, and may be used to synthesize RNA probes in vitro by addition of an 
appropriate RNA polymerase such as T7, T3, or SP6 and labeled nucleotides. These procedures 
may be conducted using a variety of commercially available kits, such as those provided by 
20 Amersham Pharmacia Biotech, Promega (Madison WI), and US Biochemical. Suitable reporter 
molecules or labels which may be used for ease of detection include radionuclides, enzymes, 
fluorescent, chemiluminescent, or chromogenic agents, as well as substrates, cofactors, inhibitors, 
magnetic particles, and the like. 

Host cells transformed with nucleotide sequences encoding TRICH may be cultured under 
25 conditions suitable for the expression and recovery of the protein from cell culture. The protein 
produced by a transformed cell may be secreted or retained uitracellularly depending on. the 
sequence and/or the vector used. As will be understood by those of skill in the art, expression 
vectors containing polynucleotides which encode TRICH may be designed to contain signal 
sequences which direct secretion of TRICH through a prokaryotic or eukaryotic cell membrane. 
30 In addition, a host cell strain may be chosen for its ability to modulate expression of the 

inserted sequences or to process the expressed protein in the desired fashion. Such modifications of 
the polypeptide include, but are not limited to, acetylation, carboxylation, glycosylation, 
phosphorylation, 'lipidation, and acylation. Post-translational processing which cleaves a "prepro" 
or "pro" form of the protein may also be used to specify protein targeting, folding, and/or activity. 
35 Different host cells which have specific cellular machinery and characteristic mechanisms for 
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postradiational activities (e.g., CHO, HeLa, MDCK, HEK293, and WI38) are available from the 
American Type Culture Collection (ATCC, Manassas VA) and may be chosen to ensure the correct 
modification and processing of the foreign protein. 

Li another embodiment of the invention, natural, modified, or recombinant nucleic acid 
5 sequences encoding TRICH may be ligated to a heterologous sequence resulting in translation of a 
fusion protein in any of the aforementioned host systems. For example, a chimeric TRICH protein 
containing a heterologous moiety that can be recognized by a commercially available antibody may 
facilitate the screening of peptide libraries for inhibitors of TRICH activity. Heterologous protein 
and peptide moieties may also facilitate purification of fusion proteins using commercially available 
10 affinity matrices. Such moieties include, but are not limited to, glutathione S-transferase (GST), 
maltose binding protein (MBP), thioredoxin (Trx), calmodulin binding peptide (CBP), 6-His, 
FLAG, c-myc, and hemagglutinin (HA). GST, MBP, Trx, CBP, and 6-His enable purification of ' 
their cognate fusion proteins on immobilized glutathione, maltose, phenylarsine oxide, calmodulin, 
and metal-chelate resins, respectively. FLAG, c-myc, and hemagglutinin (HA) enable 
15 immunoaffinity purification of fusion proteins using commercially available monoclonal and 
polyclonal antibodies that specifically recognize these epitope tags. A fusion protein may also be 
engineered to contain a proteolytic cleavage site located between the TRICH encoding sequence and 
the heterologous protein sequence, so that TRICH may be cleaved away from the heterologous 
moiety following purification. Methods for fusion protein expression and purification are discussed 
20 in Ausubel (1995, supra, ch. 10). A variety of commercially available kits may also be used to 
facilitate expression and purification of fusion proteins. 

In a further embodiment of the invention, synthesis of radiolabeled TRICH may be achieved 
invitro using the TNT rabbit reticulocyte lysate or wheat germ extract system (Promega). These 
systems couple transcription and translation of protein-coding sequences operably associated with 
25 the T7, T3, or SP6 promoters. Translation takes place in the presence of a radiolabeled amino acid 
precursor, for example, 3s S-methionine. 

TRICH of the present invention or fragments thereof may be used to screen for compounds 
that specifically bind to TRICH. At least one and up to a plurality of test compounds may be 
screened for specific binding to TRICH. Examples of test compounds include antibodies, 
30 oligonucleotides, proteins (e.g., receptors), or small molecules. 

In one embodiment, the compound thus identified is closely related to the natural ligand of 
TRICH, e.g., a ligand or fragment thereof, a natural substrate, a structural or functional mimetic, or 
a natural binding partner. (See, e.g., Coligan, J.E. et al. (1991) Current Protocols in Imm.mnlnPY 
l(2):.Chapter 5.) Similarly, the compound can be closely related to the natural receptor to which 
35 TRICH binds, or to at least a fragment of the receptor, e.g., the ligand binding site. In either case, 
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the compound can be rationally designed using known techniques. In one embodiment, screening 
for these compounds involves producing appropriate cells which express TRICH, either as a 
secreted protein or on the cell membrane. Preferred cells include cells from mammals, yeast, 
Drosophila , or E. coli . Cells expressing TRICH or cell membrane fractions which contain TRICH 

5 are then contacted with a test compound and binding, stimulation, or inhibition of activity of either 
TRICH or the compound is analyzed. 

An assay may simply test binding of a test compound to the polypeptide, wherein binding is 
detected by a fluorophore, radioisotope, enzyme conjugate, or other detectable label For example, 
the assay may comprise the steps of combining at least one test compound with TRICH, either in 

10 solution or affixed to a solid support, and detecting the binding of TRICH to the compound. 
Alternatively, the assay may detect or measure binding of a test compound in the presence of a 
labeled competitor. Additionally, the assay may be carried out using cell-free preparations, ' 
chemical libraries, or natural product mixtures, and the test compound(s) may be free in solution or 
affixed to a solid support. 

15 TRICH of the present invention or fragments thereof may be used to screen for compounds 

that modulate the activity of TRICH. Such compounds may include agonists, antagonists, or partial 
or inverse agonists. In one embodiment, an assay is performed under conditions permissive for 
TRICH activity, wherein TRICH is combined with at least one test compound, and the activity of 
TRICH in the presence of a test compound is compared with the activity of TRICH in the absence 

20 of the test compound. A change in the activity of TRICH in the presence of the test compound is 
indicative of a compound that modulates the activity of TRICH. Alternatively, a test compound is 
combined with an in vitro or cell-free system comprising TRICH under conditions suitable' for 
TRICH activity, and the assay is performed. In either of these assays, a test compound which 
modulates the activity of TRICH may do so indirectly and need not come in direct contact with the 

25 test compound. At least one and up to a plurality of test compounds may be screened. 

In another embodiment, polynucleotides encoding TRICH or their mammalian homologs 
may be "knocked out" in an animal model system using homologous recombination in embryonic 
stem (ES) cells. Such techniques are well known in the art and are useful for the generation of 
animal models of human disease. (See, e.g., U.S. Patent No. 5,175,383 and U.S. Patent No. 

30 5,767,337.) For example, mouse ES cells, such as the mouse 129/SvJ cell line, are derived from the 
early mouse embryo andgrown in culture. The ES cells are transformed with a vector containing 
the gene of interest disrupted by a marker gene, e.g., the neomycin phosphotransferase gene (neo; 
Capecchi, M.R. (1989) Science 244: 1288-1292). The vector integrates into the corresponding 
region of the host genome by homologous recombination. Alternatively, homologous 

35 recombination takes place using the Cre-loxP system to knockout a gene of interest in a tissue- or 
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developmental stage-specific manner (Marth, J.D. (1996) Clin. Invest. 97:1999-2002; Wagner, K.U. 
et al. (1997) Nucleic Acids Res. 25:4323-4330). Transformed ES cells are identified and 
microinjected into mouse cell blastocysts such as those from the C57BL/6 mouse strain. The 
blastocysts are surgically transferred to pseudopregnant dams, and the resulting chimeric progeny 
5 are genotyped and bred to produce heterozygous or homozygous strains. Transgenic animals thus 
generated may be tested with potential therapeutic or toxic agents. 

Polynucleotides encoding TRICH may also be manipulated in vitro in ES cells derived from 
human blastocysts. Human ES cells have the potential to differentiate into at least eight separate 
cell lineages including endoderm, mesoderm, and ectodermal cell types. These cell lineages 
10 differentiate into, for example, neural cells, hematopoietic lineages, and cardiomyocytes (Thomson, 
J.A. et al. (199S) Science 282:1145-1147). 

Polynuclcot ides encoding TRICH can also be used to create "knockin" humanized animals ' 
(pigs) or transgenic animals (mice or rats) to model human disease. With knockin technology, a 
region of a polynucleotide encoding TRICH is injected into animal ES cells, and the injected 
15 sequence integrates into the animal cell genome. Transformed cells are injected into blastulae, and 
the blastulae arc implanted as described above. Transgenic progeny or inbred lines are studied and 
treated with potential pharmaceutical agents to obtain information on treatment of a human disease. 
Alternatively, a mammal inbred to overexpress TRICH, e.g., by secreting TRICH in its milk, may 
also serve as a convenient source of that protein (Janne, J. et al. (1998) Biotechnol. Annu. Rev. 
20 4:55-74). 

THERAPEUTICS 

Chemical and structural similarity, e.g., in the context of sequences and motifs, exists 
between regions of TRICH and transporters and ion channels. In addition, examples of tissues 
expressing TRICH can be found in Table 6. Therefore, TRICH appears to play a role in transport, 

25 neurological, muscle, immunological and cell proliferative disorders. In the treatment of disorders 
associated with increased TRICH expression or activity, it is desirable to decrease the expression or 
activity of TRICH. In the treatment of disorders associated with decreased TRICH expression or 
activity, it is desirable to increase the expression or activity of TRICH. 

Therefore, in one embodiment, TRICH or a fragment or derivative thereof may be 

30 administered to a subject to treat or prevent a disorder associated with decreased expression or 
activity of TRICH. Examples of such disorders include, but are not limited to, a transport disorder 
such as akinesia, amyotrophic lateral sclerosis, ataxia telangiectasia, cystic fibrosis, Becker's 
muscular dystrophy, Bell's palsy, Charcot-Marie Tooth disease, diabetes meUitus, diabetes 
insipidus, diabetic neuropathy, Duchenne muscular dystrophy, hyperkalemic periodic paralysis, 

35 normokalemic periodic paralysis, Parkinson's disease, malignant hyperthermia, multidrug 
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resistance, myasthenia gravis, myotonic dystrophy, catatonia, tardive dyskinesia, dystonias, 
peripheral neuropathy, cerebral neoplasms, prostate cancer, cardiac disorders associated with 
transport, e.g., angina, bradyarrythniia, tachyarrythmia, hypertension, Long QT syndrome, 
myocarditis, cardiomyopathy, nemaline myopathy, centronuclear myopathy, lipid myopathy, 

5 mitochondrial myopathy, thyrotoxic myopathy, ethanol myopathy, dermatomyositis, inclusion body 
myositis, infectious myositis, polymyositis, neurological disorders associated with transport, e.g., 
Alzheimer's disease, amnesia, bipolar disorder, dementia, depression, epilepsy, Tourette's disorder, 
paranoid psychoses, and schizophrenia, and other disorders associated with transport, e.g., 
neurofibromatosis, postherpetic neuralgia, trigeminal neuropathy, sarcoidosis, sickle cell anemia, 

10 Wilson's disease, cataracts, infertility, pulmonary artery stenosis, sensorineural autosomal deafness, 
hyperglycemia, hypoglycemia, Grave's disease, goiter, Cushing's disease, Addison's disease, 
glucose-galactose malabsorption syndrome, glycogen storage disease, hypercholesterolemia, 
adrenoleukodystrophy, Zellweger syndrome, Menkes disease, occipital horn syndrome, von Gierke 
disease, pseudohypoaldosteronism type 1, Liddle's syndrome, cystinuria, iminoglycinuria, Hartup 

15 disease, Fanconi disease, and Bartter syndrome; a neurological disorder such as epilepsy, ischemic 
cerebrovascular disease, stroke, cerebral neoplasms, Alzheimer's disease, Pick's disease, 
Huntington's disease, dementia, Parkinson's disease and other extrapyramidal disorders, 
amyotrophic lateral sclerosis and other motor neuron disorders, progressive neural muscular 
atrophy, retinitis pigmentosa, hereditary ataxias, multiple sclerosis and other demyelinating 

20 diseases, bacterial and viral meningitis, brain abscess, subdural empyema, epidural abscess, 
suppurative intracranial thrombophlebitis, myelitis and radiculitis, viral central nervous system 
disease, prion diseases including kuru, Creutzfeldt-Jakob disease, and Gerstmann- 
Straussier-Scheinker syndrome, fatal familial insomnia, nutritional and metabolic diseases of the 
nervous system, neurofibromatosis, tuberous sclerosis, cerebelloretinal hemangioblastomatosis, 

25 encephalotrigeminal syndrome, mental retardation and other developmental disorders of the central 
nervous system including Down syndrome, cerebral palsy, neuroskeletal disorders, autonomic 
nervous system disorders, cranial nerve disorders, spinal cord diseases, muscular dystrophy and 
other neuromuscular disorders, peripheral nervous system disorders, dermatomyositis and 
polymyositis, inherited, metabolic, endocrine, and toxic myopathies, myasthenia gravis, periodic 

30 paralysis, mental disorders including mood, anxiety, and schizophrenic disorders, seasonal affective 
disorder (SAD), akathesia, amnesia, catatonia, diabetic neuropathy, hemiplegic migraine, tardive 
dyskinesia, dystonias, paranoid psychoses, postherpetic neuralgia, Tourette's disorder, progressive 
supranuclear palsy, corticobasal degeneration, and familial frontotemporal dementia; a muscle 
disorder such as cardiomyopathy, myocarditis, Duchenne's muscular dystrophy, Becker's muscular 

35 dystrophy, myotonic dystrophy, central core disease, nemaline myopathy, centronuclear myopathy, 
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lipid myopathy, mitochondrial myopathy, infectious myositis, polymyositis, deimatomyositis, 
inclusion body myositis, thyrotoxic myopathy, ethanol myopathy, angina, anaphylactic shock, 
arrhythmias, asthma, cardiovascular shock, Gushing' s syndrome, hypertension, hypoglycemia, 
myocardial infarction, migraine, pheochromocytoma, and myopathies including encephalopathy, 
5 epilepsy, Kearns-Sayre syndrome, lactic acidosis, myoclonic disorder, ophthalmoplegia, acid 
maltase deficiency (AMD, also known as Pompe's disease), generalized myotonia, and myotonia 
congenita; an immunological disorder such as acquired immunodeficiency syndrome (AIDS), 
Addison's disease, adult respiratory distress syndrome, allergies, ankylosing spondylitis, 
amyloidosis, anemia, asthma, atherosclerosis, autoimmune hemolytic anemia, autoimmune 

10 thyroiditis, autoimmune polyendocrinopathy-candidiasis-ectodennal dystrophy (APECED), 
bronchitis, cholecystitis, contact dermatitis, Crohn's disease, atopic dermatitis, dermatomyositis, 
diabetes mellitus, emphysema, episodic lymphopenia with lymphocytotoxins, erythroblastosis ' 
fetalis, erythema nodosum, atrophic gastritis, glomerulonephritis, Goodpasture's syndrome, gout, 
Graves' disease, Hashimoto's thyroiditis, bypereosinophilia, irritable bowel syndrome, multiple 

15 sclerosis, myasthenia gravis, myocardial or pericardial inflammation, osteoarthritis, osteoporosis, 
pancreatitis, polymyositis, psoriasis, Reiter's syndrome, rheumatoid arthritis, scleroderma, 
Sjogren's syndrome, systemic anaphylaxis, systemic lupus erythematosus, systemic sclerosis, 
thrombocytopenic purpura, ulcerative colitis, uveitis, Werner syndrome, complications of cancer, 
hemodialysis, and extracorporeal circulation, viral, bacterial, fungal, parasitic, protozoal, and 

20 helminthic infections, and trauma; and a cell proliferative disorder such as actinic keratosis, 
arteriosclerosis, atherosclerosis, bursitis, cirrhosis, hepatitis, mixed connective tissue disease 
(MCTD), myelofibrosis, paroxysmal nocturnal hemoglobinuria, polycythemia vera, psoriasis, 
primary thrombocythemia, and cancers including adenocarcinoma, leukemia, lymphoma, melanoma, 
myeloma, sarcoma, teratocarcinoma, and, in particular, cancers of the adrenal gland, bladder, bone, 

25 bone marrow, brain, breast, cervix, gall bladder, ganglia, gastrointestinal tract, heart, kidney, liver, 
lung, muscle, ovary, pancreas, parathyroid, penis, prostate, salivary glands, skin, spleen, testis, 
thymus, thyroid, and uterus. 

In another embodiment, a vector capable of expressing TRICH or a fragment or derivative 
thereof may be administered to a subject to treat or prevent a disorder associated with decreased 

30 expression or activity of TRICH including, but not limited to, those described above. 

In a further embodiment, a composition comprising a substantially purified TRICH in 
conjunction with a suitable pharmaceutical carrier may be administered to a subject to treat or 
prevent a disorder associated with decreased expression or activity of TRICH including, but not 
limited to, those provided above. 

35 In still another embodiment, an agonist which modulates the activity of TRICH may be 
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administered to a subject to treat or prevent a disorder associated with decreased expression or 
activity of TRICH including, but not limited to, those listed above. 

In a further embodiment, an antagonist of TRICH may be administered to a subject to treat 
or prevent a disorder associated with increased expression or activity of TRICH. Examples of such 
5 disorders include, but are not limited to, those transport, neurological, muscle, immunological and 
cell proliferative disorders described above. In one aspect, an antibody which specifically binds 
TRICH may be used directly as an antagonist or indirectly as a targeting or delivery mechanism for 
bringing a pharmaceutical agent to cells or tissues which express TRICH. 

In an additional embodiment, a vector expressing the complement of the polynucleotide 

10 encoding TRICH may be administered to a subject to treat or prevent a disorder associated with 
increased expression or activity of TRICH including, but not limited to, those described above. 

In other embodiments, any of the proteins, antagonists, antibodies, agonists, complementary 
sequences, or vectors of the invention may be administered in combination with other appropriate 
therapeutic agents. Selection of the appropriate agents for use in combination therapy may be made 

15 by one of ordinary skill in the art, according to conventional pharmaceutical principles. The 

combination of therapeutic agents may act synergistically to effect the treatment or prevention of the 
various disorders described above. Using this approach, one may be able to achieve therapeutic 
efficacy with lower dosages of each agent, thus reducing the potential for adverse side effects. 

An antagonist of TRICH may be produced using methods which are generally known in the 

20 art. In particular, purified TRICH may be used to produce antibodies or to screen libraries of 

pharmaceutical agents to identify those which specifically bind TRICH. Antibodies to TRICH may 
also be generated using methods that are well known in the art. Such antibodies may include, but 
are not limited to, polyclonal, monoclonal, chimeric, and single chain antibodies, Fab fragments, 
and fragments produced by a Fab expression library. Neutralizing antibodies (i.e., those which 

25 inhibit dimer formation) are generally preferred for therapeutic use. 

For the production of antibodies, various hosts including goats, rabbits, rats, mice, humans, 
and others may be immunized by injection with TRICH or with any fragment or oligopeptide 
thereof which has immunogenic properties. Depending on the host species, various adjuvants may 
be used to increase immunological response. Such adjuvants include, but are not limited to, 

30 Freund's, mineral gels such as aluminum hydroxide, and surface active substances such as 

lysolecithin, pluronic polyols, polyanions, peptides, oil emulsions, KLH, and dinitrophenol. Among 
adjuvants used in humans, BCG (bacilli Calmette-Guerin) and Corvnebacterium parvum are 
especially preferable. 

It is preferred that the oligopeptides, peptides, or fragments used to induce antibodies to 
35 TRICH have an amino acid sequence consisting of at least about 5 amino acids, and generally will 
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consist of at least about 10 amino acids. It is also preferable that these oligopeptides, peptides, or 
fragments are identical to a portion of the amino acid sequence of the natural protein. Short 
stretches of TRICH amino acids may be fused with those of another protein, such as KLH, and 
antibodies to the chimeric molecule may be produced. 
5 Monoclonal antibodies to TRICH may be prepared using any technique which provides for 

the.production of antibody molecules by continuous cell lines in culture. These include, but are not 
limited to, the hybridoma technique, the human B-cell hybridoma technique, and the EBV- 
hybridoma technique. (See, e.g., Kohler, G. et al. (1975) Nature 256:495-497; Kozbor, D. et al. 
(1985) J. Immunol. Methods 81:31^2; Cote, R.J. et al. (1983) Proc. Natl. Acad. Sci. USA 

10 80:2026-2030; and Cole, S.P. et al. (1984) Mol. Cell Biol. 62: 109-120.) 

In addition, techniques developed for the production of "chimeric antibodies," such as the 
splicing of mouse antibody genes to human antibody genes to obtain a molecule with appropriate ' 
antigen specificity and biological activity, can be used. (See, e.g., Morrison, S.L. et al. (1984) Proc. 
Natl. Acad. Sci. USA 81:6851-6855; Neuberger, M.S. et al. (1984) Nature 312:604-608; and 

15 Takeda, S. et al. (1985) Nature 3 14:452-454.) Alternatively, techniques described for the 

production of single chain antibodies may be adapted, using methods known in the art, to produce 
TRICH-specific single chain antibodies. Antibodies with related specificity, but of distinct 
idiotypic composition, may be generated by chain shuffling from random combinatorial 
immunoglobulin libraries. (See, e.g., Burton, D.R. (1991) Proc. Natl. Acad. Sci. USA 88:10134- 
20 10137.) 

Antibodies may also be produced by inducing in_yjyo production in the lymphocyte 
population or by screening immunoglobulin libraries or panels of highly specific binding reagents as 
disclosed in the literature. (See, e.g., Orlandi, R. et al. (1989) Proc. Natl. Acad. Sci. USA 
86:3833-3837; Winter, G. et al. (1991) Nature 349:293-299.) 

25 Antibody fragments which contain specific binding sites for TRICH may also be generated. 

For example, such fragments include, but are not limited to, F(ab') 2 fragments produced by pepsin 
digestion of the antibody molecule and Fab fragments generated by reducing the disulfide bridges of 
the F(ab')2 fragments. Alternatively, Fab expression libraries may be constructed to allow rapid and 
easy identification of monoclonal Fab fragments with the desired specificity. (See, e.g., Huse, W.D. 

30 et al. (1989) Science 246: 1275-1281.) 

Various immunoassays may be used for screening.to identify antibodies having the desired 
specificity. Numerous protocols for competitive binding or immunoradiometric assays using either 
polyclonal or monoclonal antibodies with established specificities are well known in the art. Such 
immunoassays typically involve the measurement of complex formation between TRICH and its 

35 specific antibody. A two-site, monoclonal-based immunoassay utilizing monoclonal antibodies 
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reactive to two non-interfering TRICH epitopes is generally used, but a competitive binding assay 
may also be employed (Pound, supra) . 

Various methods such as Scatchard analysis in conjunction with radioimmunoassay 
techniques may be used to assess the affinity of antibodies for TRICH. Affinity is expressed as an 

5 association constant, K,, which is defined as the molar concentration of TRICH-antibody complex 
divided by the molar concentrations of free antigen and free antibody under equilibrium conditions. 
The K a determined for a preparation of polyclonal antibodies, which are heterogeneous in their 
affinities for multiple TRICH epitopes, represents the average affinity, or avidity, of the antibodies 
for TRICH. The K, determined for a preparation of monoclonal antibodies, which are monospecific 

10 for a particular TRICH epitope, represents a true measure of affinity. High-affinity antibody 
preparations with K a ranging from about 10 9 to 10 12 L/mole are preferred for use in immunoassays 
in which the TRICH-antibody complex must withstand rigorous manipulations. Low-affinity •? 
antibody preparations with ranging from about 10 6 to 10 7 L/mole are preferred for use in 
immunopurification and similar procedures which ultimately require dissociation of TRICH, 

15 preferably in active form, from the antibody (Catty, D. (1988) Antibodies, Volume I: A Practical 
Approach , IRL Press, Washington DC; Liddell, J.E. and A. Oyer (1991) A Practical Guide to 
Monoclonal Antibodies . lohn Wiley & Sons, New York NY). 

The titer and avidity of polyclonal antibody preparations may be further evaluated to 
determine the quality and suitability of such preparations for certain downstream applications. For 

20 example, a polyclonal antibody preparation containing at least 1-2 mg specific antibody/ml, 
preferably 5-10 mg specific antibody/ml, is generally employed in procedures requiring 
precipitation of TRICH-antibody complexes. Procedures for evaluating antibody specificity, titer, 
and avidity, and guidelines for antibody quality and usage in various applications, are generally 
available. (See, e.g., Catty, supra , and Coligan et al. supra .) 

25 In another embodiment of the invention, the polynucleotides encoding TRICH, or any 

fragment or complement thereof, may be used for therapeutic purposes. In one aspect, 
modifications of gene expression can be achieved by designing complementary sequences or 
antisense molecules (DNA, RNA, PNA, or modified oligonucleotides) to the coding or regulatory 
regions of the gene encoding TRICH. Such technology is well known in the art, and antisense 

30 oligonucleotides or larger fragments can be designed from various locations along the coding or 
control regions of sequences encoding TRICH. (See, e.g., Agrawal, S., ed. (1996) Antisense 
Therapeutics . Humana Press Inc., Totawa NJ.) 

In therapeutic use, any gene delivery system suitable for introduction of the antisense 
sequences into appropriate target cells can be used. Antisense sequences can be delivered 

35 intracellular^ in the form of an expression plasmid which, upon transcription, produces a sequence 
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complementary to at least a portion of the cellular sequence encoding the target protein. (See, e.g., 
Slater, J.E. et al. (1998) J. Allergy Clin. Immunol. 102(3):469-475; and Scanlon, K.J. et al. (1995) 
9(13): 1288-1296.) Antisense sequences can also be introduced intracellularly through the use of 
viral vectors, such as retrovirus and adeno-associated virus vectors. (See, e.g., Miller, A.D. (1990) 
5 Blood 76:271; Ausubel, supra: Uckert, W. and W. Walther (1994) Pharmacol. Ther. 63(3):323- 
347.) Other gene delivery mechanisms include liposome-derived systems, artificial viral envelopes, 
and other systems known in the art. (See, e.g., Rossi, JJ. (1995) Br. Med. Bull. 51(l):217-225; 
Boado, RJ. et al. (1998) J. Pharm. Sci. 87(11): 1308-1315; and Morris, M.C et al. (1997) Nucleic 
Acids Res. 25(14):2730-2736.) 
10 In another embodiment of the invention, polynucleotides encoding TRICH may be used for 

somatic or germline gene therapy. Gene therapy may be performed to (i) correct a genetic 
deficiency (e.g., in the cases of severe combined immunodeficiency (SCID)-Xl disease ' 
characterized by X-linked inheritance (Cavazzana-Calvo, M. et al. (2000) Science 288:669-672), 
severe combined immunodeficiency syndrome associated with an inherited adenosine deaminase 
15 (ADA) deficiency (Blaese, R.M. et al. (1995) Science 270:475-480; Bordignon, C. et al. (1995) 
Science 270:470-475), cystic fibrosis (Zabner, J. et al. (1993) Cell 75:207-216; Crystal, R.G. et al. 
(1995) Hum. Gene Therapy 6:643-666; Crystal, R.G. et al. (1995) Hum. Gene Therapy 6:667-703), 
thalassamias, familial hypercholesterolemia, and hemophilia resulting from Factor Vm or Factor K 
deficiencies (Crystal, R.G. (1995) Science 270:404-410; Verma, I.M. and N. Somia (1997) Nature 
20 389:239-242)), (ii) express a conditionally lethal gene product (e.g., in the case of cancers which 
result from unregulated cell proliferation), or (iii) express a protein which affords protection against 
intracellular parasites (e.g., against human retroviruses, such as human immunodeficiency virus 
(HIV) (Baltimore, D. (1988) Nature 335:395-396; Poeschla, E. et al. (1996) Proc. Natl. Acad. Sci. 
USA 93: 1 1395-1 1399), hepatitis B or C virus (HBV, HCV); fungal parasites, such as Candida 
25 albicans and Paracoccidioides brasiliensis: and protozoan parasites such as Plasmodium falciparum 
and Trypanosoma cruzi). Li the case where a genetic deficiency in TRICH expression or regulation 
causes disease, the expression of TRICH from an appropriate population of transduced cells may 
alleviate the clinical manifestations caused by the genetic deficiency. 

In a further embodiment of the invention, diseases or disorders caused by deficiencies in 
30 TRICH are treated by constructing mammalian expression vectors encoding TRICH and introducing 
these vectors by mechanical means into TRICH-deficient cells. Mechanical transfer technologies 
for use with cells in vivo or ex vitro include (i) direct DNA microinjection into individual cells, (ii) 
ballistic gold particle delivery, (iii) liposome-mediated transfection, (iv) receptor-mediated gene 
transfer, and (v) the use of DNA transposons (Morgan, R.A. and W.F. Anderson (1993) Annu. Rev. 
35 Biochem. 62: 191-217; Ivies, Z. (1997) Cell 91:501-510; Boulay, J-L. and H. Recipon (1998) Curr. 
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Opin. Biotechnol. 9:445-450). 

Expression vectors that may be effective for the expression of TRICH include, but are not 
limited to, the PCDNA 3.1, EPITAG, PRCCMV2, PREP, PVAX, PCR2-TOPOTA vectors 
(Invitrogen, Carlsbad CA), PCMV-SCRIPT, PCMV-TAG, PEGSH/PERV (Stratagene, La Jolla 
CA), and PTET-OFF, PTET-ON, PTRE2, PTRE2-LUC, PTK-HYG (Clontech, Palo Alto CA). 
TRICH may be expressed using (i) a constitutively active promoter, (e.g., from cytomegalovirus 
(CMV), Rous sarcoma virus (RSV), SV40 virus, thymidine kinase (TK), or P-actin genes), (ii) an 
inducible promoter (e.g., the tetracycline-regulated promoter (Gossen, M. and H. Bujard (1992) 
Proc. Natl. Acad. Sci. USA 89:5547-5551; Gossen, M. et al. (1995) Science 268:1766-1769; Rossi, 
F.M.V. and H.M. Blau (1998) Curr. Opin. Biotechnol. 9:451-456), commercially available in the 
T-REX plasmid (Invitrogen)); the ecdysone-inducible promoter (available in the plasmids PVGRXR 
and PIND; Invitrogen); the FK506/rapamycin inducible promoter; or the RU486/mifepristone ' 
inducible promoter (Rossi, F.M.V. and H.M. Blau, supra)), or (iii) a tissue-specific promoter or the 
native promoter of the endogenous gene encoding TRICH from a normal individual. 

Commercially available liposome transformation kits (e.g., the PERFECT LIPID 
TRANSACTION KIT, available from Invitrogen) allow one with ordinary skill in the art to deliver 
polynucleotides to target cells in culture and require minimal effort to optimize experimental 
parameters. In the alternative, transformation is performed using the calcium phosphate method 
(Graham, F.L. and A.J. Eb (1973) Virology 52:456-467), or by electroporation (Neumann, E. et al. 
(1982) EMBO J. 1:841.-845). The introduction of DNA to primary cells requires modification of 
these standardized mammalian transfection protocols. 

In another embodiment of the invention, diseases or disorders caused by genetic defects 
with respect to TRICH expression are treated by constructing a retrovirus vector consisting of (i) the 
polynucleotide encoding TRICH under the control of an independent promoter or the retrovirus long 
terminal repeat (LTR) promoter, (ii) appropriate RNA packaging signals, and (iii) a Rev-responsive 
element (RRE) along with additional retrovirus as-acting RNA sequences and coding sequences 
required for efficient vector propagation. Retrovirus vectors (e.g., PFB and PFBNEO) are 
commercially available (Stratagene) and are based on published data (Riviere, I. et al. (1995) Proc. 
Natl. Acad. Sci. USA 92:6733-6737), incorporated by reference herein. The vector is propagated in 
an appropriate vector producing cell line (VPCL) that expresses an envelope gene with a tropismfor 
receptors on the target cells or a promiscuous envelope protein such as VSVg (Armentano, D. et al. 
(1987) J. Virol. 61:1647-1650; Bender, M.A. et al. (1987) J. Virol. 61:1639-1646; Adam, M.A. and 
A.D. Miller (1988) J. Virol. 62:3802-3806; Dull, T. et al. (1998) J. Virol. 72:8463-8471; Zufferey, 
R. et al. (1998) J. Virol. 72:9873-9880). U.S. Patent No. 5,910,434 to Rigg ('Method for obtaining 
retrovirus packaging cell lines producing high transducing efficiency retroviral supernatant") 
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discloses a method for obtaining retrovirus packaging cell lines and is hereby incorporated by 
reference. Propagation of retrovirus vectors, transduction of a population of cells (e.g., CD4* T- 
cells), and the return of transduced cells to a patient are procedures well known to persons skilled in 
the art of gene therapy and have been well documented (Ranga, U. et al. (1997) J. Virol. 71:7020- 
5 7029; Bauer, G. et al. (1997) Blood 89:2259-2267; Bonyhadi, M.L. (1997) J. Virol. 71:4707-4716; 
Ranga, U. et al. (1998) Proc. Natl. Acad. Sci. USA 95:1201-1206; Su, L. (1997) Blood 89:2283- 
2290). 

In the alternative, an adenovirus-based gene therapy delivery system is used to deliver 
polynucleotides encoding TRICH to cells which have one or more genetic abnormalities with 
10 respect to the expression of TRICH. The construction and packaging of adenovirus-based vectors 
are well known to those with ordinary skill in the art. Replication defective adenovirus vectors have 
proven to be versatile for importing genes encoding immunoregulatory proteins into intact islets iiv 
the pancreas (Csete, M.E. et al. (1995) Transplantation 27:263-268). Potentially useful adenoviral 
vectors are described in U.S. Patent No. 5,707,618 to Armentano ("Adenovirus vectors for gene 
15 therapy"), hereby incorporated by reference. For adenoviral vectors, see also Antinozzi, P.A. et al. 
(1999) Annu. Rev. Nutr. 19:511-544 and Verma, LM. and N. Somia (1997) Nature 18:389:239-242, 
both incorporated by reference herein. 

In another alternative, a herpes-based, gene therapy delivery system is used to deliver 
polynucleotides encoding TRICH to target cells which have one or more genetic abnormalities with 
20 respect to the expression of TRICH. The use of heipes simplex virus (HSV)-based vectors may be 
especially valuable for introducing TRICH to cells of the central nervous system, for which HSV 
has a tropism. The construction and packaging of herpes-based vectors are well known to those 
with ordinary skill in the art. A replication-competent herpes simplex virus (HSV) type 1 -based 
vector has been used to deliver a reporter gene to the eyes of primates (Liu, X. et al. (1999) Exp. 
Eye Res. 169:385-395). The construction of a HSV-1 virus vector has also been disclosed in detail 
in U.S. Patent No. 5,804,413 to DeLuca ("Herpes simplex virus strains for gene transfer"), which is 
hereby incorporated by reference. U.S. Patent No. 5,804,413 teaches the use of recombinant HSV 
d92 which consists of a genome containing at least one exogenous gene to be transferred to a cell 
under the control of the appropriate promoter for purposes including human gene therapy. Also 
taught by this patent are the construction and use of recombinant HSV strains deleted for ICP4, 
ICP27 and ICP22. For HSV vectors, see also Goins, W.F. et al. (1999) J. Virol. 73:519-532 and Xu, 
H. et al. (1994) Dev. Biol. 163:152-161, hereby incorporated by reference. The manipulation of 
cloned herpesvirus sequences, the generation of recombinant virus following the transfection of 
multiple plasmids containing different segments of the large herpesvirus genomes, the growth and 
propagation of herpesvirus, and the infection of cells with herpesvirus are techniques well known to 
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those of ordinary skill in the art. 

In another alternative, an alphavirus (positive, single-stranded RNA virus) vector is used to 
deliver polynucleotides encoding TRICH to target cells. The biology of the prototypic alphavirus, 
Semliki Forest Virus (SFV), has been studied extensively and gene transfer vectors have been based 

5 on the SFV genome (Garoff, H. and K.-J. Li (1998) Curr. Opin. Biotechnol. 9:464-469). During 
alphavirus RNA replication, a subgenomic RNA is generated that normally encodes the viral capsid 
proteins. This subgenomic RNA replicates to higher levels than the full length genomic RNA, 
resulting in the overproduction of capsid proteins relative to the viral proteins with enzymatic 
activity (e.g., protease and polymerase). Similarly, inserting the coding sequence for TRICH into 

10 the alphavirus genome in place of the capsid-coding region results in the production of a large 
number of TRICH -coding RNAs and the synthesis of high levels of TRICH in vector transduced 
cells. While alphavirus infection is typically associated with cell lysis within a few days, the ability 
to establish a persistent infection in hamster normal kidney cells (BHK-21) with a variant of Sindbis 
virus (SIN) indicates that the lytic replication of alphaviruses can be altered to suit the needs of the 

15 gene therapy application (Dryga, S.A. et al. (1997) Virology 228:74-83). The wide host range of 
alphaviruses will allow the introduction of TRICH into a variety of cell types. The specific 
transduction of a subset of cells in a population may require the sorting of cells prior to 
transduction. The methods of manipulating infectious cDNA clones of alphaviruses, performing 
alphavirus cDNA and RNA transfections, and performing alphavirus infections, are well known to 

20 those with ordinary skill in the art. 

Oligonucleotides derived from the transcription initiation site, e.g., between about positions 
-10 and +10 from the start site, may also be employed to inhibit gene expression. Similarly, 
inhibition can be achieved using triple helix base-pairing methodology. Triple helix pairing is 
useful because it causes inhibition of the ability of the double helix to open sufficiently for the 

25 binding of polymerases, transcription factors, or regulatory molecules. Recent therapeutic advances 
using triplex DNA have been described in the literature. (See, e.g., Gee, J.E. et al. (1994) in Huber, 
B.E. and B.I. Carr, Molecular and Immunologic Approaches . Futura Publishing, Mt. Kisco NY, pp. 
163-177.) A complementary sequence or antisense molecule may also be designed to block 
translation of mRNA by preventing the transcript from binding to ribosomes. 

30 Ribozymes, enzymatic RNA molecules, may also be used to catalyze the specific cleavage 

of RNA. The mechanism of ribozyme acdon involves sequence-specific hybridization of the 
ribozyme molecule to complementary target RNA, followed by endonucleolytic cleavage. For 
example, engineered hammerhead motif ribozyme molecules may specifically and efficiently 
catalyze endonucleolytic cleavage of sequences encoding TRICH. 

35 Specific ribozyme cleavage sites within any potential RNA target are initially identified by 
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scanning the target molecule for ribozyme cleavage sites, including the following sequences: GUA, 
GUU, and GUC. Once identified, short RNA sequences of between 15 and 20 ribonucleotides, 
corresponding to the region of the target gene containing the cleavage site, may be evaluated for 
secondary structural features which may render the oligonucleotide inoperable. The suitability of 
5 candidate targets may also be evaluated by testing accessibility to hybridization with 
complementary oligonucleotides using ribonuclease protection assays. 

Complementary ribonucleic acid molecules and ribozymes of the invention may be prepared 
by any method known in the art for the synthesis of nucleic acid molecules. These include 
techniques for chemically synthesizing oligonucleotides such as solid phase phosphoramidite 
10 chemical synthesis. Alternatively, RNA molecules may be generated by in vitro and in vivo 

transcription of DNA sequences encoding TRICH. Such DNA sequences may be incorporated into 
a wide variety of vectors with suitable RNA polymerase promoters such as T7 or SP6. ' 
Alternatively, these cDNA constructs that synthesize complementary RNA, constitutively or 
inducibly, can be introduced into cell lines, cells, or tissues. 

15 RNA molecules may be modified to increase intracellular stability and half-life. Possible 

modifications include, but are not limited to, the addition of flanking sequences at the 5* and/or 3' 
ends of the molecule, or the use of phosphorothioate or 2'O-methyl rather than phosphodiesterase 
linkages within the backbone of the molecule. This concept is inherent in the production of PNAs 
and can be extended in all of these molecules by the inclusion of nontraditional bases such as 

20 inosine, queosine, and wybutosine, as well as acetyl-, methyl-, thio-, and similarly modified forms 
of adenine, cytidine, guanine, thymine, and uridine which are not as easily recognized by 
endogenous endonucleases. 

An additional embodiment of the invention encompasses a method for screening for a 
compound which is effective in altering expression of a polynucleotide encoding TRICH. 

25 Compounds which may be effective in altering expression of a specific polynucleotide may include, 
but are not limited to, oligonucleotides, antisense oligonucleotides, triple helix-forming 
oligonucleotides, transcription factors and other polypeptide transcriptional regulators, and non- 
macromolecular chemical entities which are capable of interacting with specific polynucleotide 
sequences. Effective compounds may alter polynucleotide expression by acting as either inhibitors 

30 or promoters of polynucleotide expression. Thus, in the treatment of disorders associated with 
increased TRICH expression or activity, a compound which specifically inhibits expression of the 
polynucleotide encoding TRICH may be therapeutically useful, and in the treatment of disorders 
associated with decreased TRICH expression or activity, a compound which specifically promotes 
expression of the polynucleotide encoding TRICH may be therapeutically useful. 

35 At least one, and up to a plurality, of test compounds may be screened for effectiveness in 
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altering expression of a specific polynucleotide. A test compound may be obtained by any method 
commonly known in the art, including chemical modification of a compound known to be effective 
in altering polynucleotide expression; selection from an existing, commercially-available or 
proprietary library of naturally-occurring or non-natural chemical compounds; rational design of a 
5 compound based on chemical and/or structural properties of the target polynucleotide; and selection 
from a library of chemical compounds created combinatorially or randomly. A sample comprising 
a polynucleotide encoding TRICH is exposed to at least one test compound thus obtained. The 
sample may comprise, for example, an intact or permeabilized cell, or an in vitro cell-free or 
reconstituted biochemical system. Alterations in the expression of a polynucleotide encoding 

10 TRICH are assayed by any method commonly known in the art. Typically, the expression of a 
specific nucleotide is detected by hybridization with a probe having a nucleotide sequence 
complementary to the sequence of the polynucleotide encoding TRICH. The amount of ' 
hybridization may be quantified, thus forming the basis for a comparison of the expression of the 
polynucleotide both with and without exposure to one or more test compounds. Detection of a 

15 change in the expression of a polynucleotide exposed to a test compound indicates that the test 
compound is effective in altering the expression of the polynucleotide. A screen for a compound 
effective in altering expression of a specific polynucleotide can be carried out, for example, using a 
Schizosaccharomvces pombe gene expression system (Atkins, D. et al. (1999) U.S. Patent No. 
5,932,435; Arndt, G.M. et al. (2000) Nucleic Acids Res. 28:E15) or a human cell line such as HeLa 

20 cell (Clarke, M.L. et al. (2000) Biochem. Biophys. Res. Commun. 268:8-13). A particular 

embodiment of the present invention involves screening a combinatorial library of oligonucleotides 
(such as deoxyribonucleotides, ribonucleotides, peptide nucleic acids, and modified 
oligonucleotides) for antisense activity against a specific polynucleotide sequence (Bruice, T.W. et 
al. (1997) U.S. Patent No. 5,686,242; Bruice, T.W. et al. (2000) U.S. Patent No. 6,022,691). 

25 Many methods for introducing vectors into cells or tissues are available and equally suitable 

for use in vivo , in vitro, and ex vivo . For ex vivo therapy, vectors may be introduced into stem cells 
taken from the patient and clonally propagated for autologous transplant back into that same patient. 
Delivery by transfection, by liposome injections, or by polycationic amino polymers may be 
achieved using methods which are well known in the art (See, e.g., Goldman, C.K. et al. (1997) 

30 Nat. Biotechnol. 15:462-466.) 

Any of the therapeutic methods described above may be applied to any subject in need of 
such therapy, including, for example, mammals such as humans, dogs, cats, cows, horses, rabbits, 
and monkeys. 

An additional embodiment of the invention relates to the administration of a composition 
35 which generally comprises an active ingredient formulated with a pharmaceutically acceptable 
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excipient. Excipients may include, for example, sugars, starches, celluloses, gums, and proteins. 
Various formulations are commonly known and are thoroughly discussed in the latest edition of 
Remington's Pharmaceutical Sciences (Maack Publishing, Easton PA). Such compositions may 
consist of TRICH, antibodies to TRICH, and mimetics, agonists, antagonists, or inhibitors of 
5 TRICH. 

The compositions utilized in this invention may be administered by any number of routes 
including, but not limited to, oral, intravenous, intramuscular, intra-arterial, intramedullary, 
intrathecal, intraventricular, pulmonary, transdermal, subcutaneous, intraperitoneal, intranasal, 
enteral, topical, sublingual, or rectal means. 

10 Compositions for pulmonary administration may be prepared in liquid or dry powder form. 

These compositions arc generally aerosolized immediately prior to inhalation by the patient. In the 
case of small molecules (e.g. traditional low molecular weight organic drugs), aerosol delivery of ■ ' 
fast-acting formulations is well-known in the art. In the case of macromolecules (e.g. larger 
peptides and proteins), recent developments in the field of pulmonary delivery via the alveolar 

15 region of the lung have enabled the practical delivery of drugs such as insulin to blood circulation 
(see, e.g., Patton, J.S. et ah. U.S. Patent No. 5,997,848). Pulmonary delivery has the advantage of 
administration without needle injection, and obviates the need for potentially toxic penetration 
enhancers. 

Compositions suitable for use in the invention include compositions wherein the active 
20 ingredients are contained in an effective amount to achieve the intended purpose. The 
determination of an effective dose is well within the capability of those skilled in the art. 

Specialized forms of compositions may be prepared for direct intracellular delivery of 
macromolecules comprising TRICH or fragments thereof. For example, liposome preparations 
containing a cell-impermeable macromolecule may promote cell fusion and intracellular delivery of 
25 the macromolecule. Alternatively, TRICH or a fragment thereof may be joined to a short cationic 
N-terminal portion from the HTV Tat-1 protein. Fusion proteins thus generated have been found to 
transduce into the cells of all tissues, including the brain, in a mouse model system (Schwarze, S.R. 
et al. (1999) Science 285:1569-1572). 

For any compound, the therapeutically effective dose can be estimated initially either in cell 
30 culture assays, e.g., of neoplastic cells, or in animal models such as mice, rats, rabbits, dogs, 

monkeys, or pigs. An animal model may also be used to determine the appropriate concentration 
range and route of administration. Such information can then be used to determine useful doses and 
routes for administration in humans. 

A therapeutically effective dose refers to that amount of active ingredient, for example 
35 TRICH or fragments thereof, antibodies of TRICH, and agonists, antagonists or inhibitors of 
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TRICH, which ameliorates the symptoms or condition. Therapeutic efficacy and toxicity may be 
determined by standard pharmaceutical procedures in cell cultures or with experimental animals, 
such as by calculating the ED 50 (the dose therapeutically effective in 50% of the population) or LD 50 
(the dose lethal to 50% of the population) statistics. The dose ratio of toxic to therapeutic effects is 
5 the therapeutic index, which can be expressed as the LD 5( /ED 50 ratio. Compositions which exhibit 
large therapeutic indices are preferred. The data obtained from cell culture assays and animal 
studies are used to formulate a range of dosage for human use. The dosage contained in such 
compositions is preferably within a range of circulating concentrations that includes the ED 50 with 
little or no toxicity. The dosage varies within this range depending upon the dosage form employed, 

10 the sensitivity of the patient, and the route of administration. 

The exact dosage will be determined by the practitioner, in light of factors related to the 
subject requiring treatment. Dosage and administration are adjusted to provide sufficient levels of ' 
the active moiety or to maintain the desired effect. Factors which may be taken into account include 
the severity of the disease state, the general health of the subject, the age, weight, and gender of the 

15 subject, time and frequency of administration, drug combination(s), reaction sensitivities, and 

response to therapy. Long-acting compositions may be administered every 3 to 4 days, every week, 
or biweekly depending on the half-life and clearance rate of the particular formulation. 

Normal dosage amounts may vary from about 0.1 to 100,000 /.*g, up to a total dose of 
about 1 gram, depending upon the route of administration. Guidance as to particular dosages and 

20 methods of delivery is provided in the literature and generally available to practitioners in the art. 
Those skilled in the art will employ different formulations for nucleotides than for proteins or their 
inhibitors. Similarly, delivery of polynucleotides or polypeptides will be specific to particular cells, 
conditions, locations, etc. 
DIAGNOSTICS 

25 In another embodiment, antibodies which specifically bind TRICH may be used for the 

diagnosis of disorders characterized by expression of TRICH, or in assays to monitor patients being 
treated with TRICH or agonists, antagonists, or inhibitors of TRICH. Antibodies useful for 
diagnostic purposes may be prepared in the same manner as described above for therapeutics. 
Diagnostic assays for TRICH include methods which utilize the antibody and a label to detect 

30 TRICH in human body fluids or in extracts of cells or tissues. The antibodies may be used with or 
without modification, and may be labeled by covalent or non-covalent attachment of a reporter 
molecule. A wide variety of reporter molecules, several of which are described above, are known in 
the art and may be used. 

' A variety of protocols for measuring TRICH, including ELISAs, RIAs, and FACS, are 

35 known in the art and provide a basis for diagnosing altered or abnormal levels of TRICH expression. 
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Normal or standard values for TRICH expression are established by combining body fluids or cell 
extracts taken from normal mammalian subjects, for example, human subjects, with antibodies to 
TRICH under conditions suitable for complex formation. The amount of standard complex 
formation may be quantitated by various methods, such as photometric means. Quantities of 
5 TRICH expressed in subject, control, and disease samples from biopsied tissues are compared with 
the standard values. Deviation between standard and subject values establishes the parameters for 
diagnosing disease. 

In another embodiment of the invention, the polynucleotides encoding TRICH may be used 
for diagnostic purposes. The polynucleotides which may be used include oligonucleotide 
10 sequences, complementary RNA and DNA molecules, and PNAs. The polynucleotides may be used 
to detect and quantify gene expression in biopsied tissues in which expression of TRICH may be 
correlated with disease. The diagnostic assay may be used to determine absence, presence, and •? 
excess expression of TRICH, and to monitor regulation of TRICH levels during therapeutic 
intervention. 

15 In one aspect, hybridization with PCR probes which are capable of detecting polynucleotide 

sequences, including genomic sequences, encoding TRICH or closely related molecules may be 
used to identify nucleic acid sequences which encode TRICH. The specificity of the probe, whether 
it is made from a highly specific region, e.g., the 5* regulatory region, or from a less specific region, 
e.g., a conserved motif, and the stringency of the hybridization or amplification will determine 
20 whether the probe identifies only naturally occurring sequences encoding TRICH, allelic variants, 
or related sequences. 

Probes may also be used for the detection of related sequences, and may have at least 50% 
sequence identity to any of the TRICH encoding sequences. The hybridization probes of the subject 
invention may be DNA or RNA and may be derived from the sequence of SEQ ID NO:33-64 or 
from genomic sequences including promoters, enhancers, and introns of the TRICH gene. 

Means for producing specific hybridization probes for DNAs encoding TRICH include the 
cloning of polynucleotide sequences encoding TRICH or TRICH derivatives into vectors for the 
production of mRNA probes. Such vectors are known in the art, are commercially available, and 
may be used to synthesize RNA probes in vitro by means of the addition of the appropriate RNA 
polymerases and the appropriate labeled nucleotides. Hybridization probes may be labeled by a 
variety of reporter groups, for example, by radionuclides such as 32 P or 35 S, or by enzymatic labels, 
such as alkaline phosphatase coupled to the probe via avidin/biotin coupling systems, and the like. 

Polynucleotide sequences encoding TRICH may be used for the diagnosis of disorders 
associated with expression of TRICH. Examples of such disorders include, but are not limited to, a 
transport disorder such as akinesia, amyotrophic lateral sclerosis, ataxia telangiectasia, cystic 
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fibrosis, Becker's muscular dystrophy, Bell's palsy, Charcot-Marie Tooth disease, diabetes mellitus, 
diabetes insipidus, diabetic neuropathy, Duchenne muscular dystrophy, hyperkalemic periodic 
paralysis, normokalemic periodic paralysis, Parkinson's disease, malignant hyperthermia, multidrug 
resistance, myasthenia gravis, myotonic dystrophy, catatonia, tardive dyskinesia, dystonias, 
5 peripheral neuropathy, cerebral neoplasms, prostate cancer, cardiac disorders associated with 
transport, e.g., angina, bradyarrythmia, tachyarrythmia, hypertension, Long QT syndrome, 
myocarditis, cardiomyopathy, nemaline myopathy, centronuclear myopathy, lipid myopathy, 
mitochondrial myopathy, thyrotoxic myopathy, ethanol myopathy, dermatomyositis, inclusion body 
myositis, infectious myositis, polymyositis, neurological disorders associated with transport, e.g., 

10 Alzheimer's disease, amnesia, bipolar disorder, dementia, depression, epilepsy, Tourette's disorder, 
paranoid psychoses, and schizophrenia, and other disorders associated with transport, e.g., % 
neurofibromatosis, postherpetic neuralgia, trigeminal neuropathy, sarcoidosis, sickle cell anemia, ' 
Wilson's disease, cataracts, infertility, pulmonary artery stenosis, sensorineural autosomal deafness, 
hyperglycemia, hypoglycemia, Grave's disease, goiter, Cushing's disease, Addison's disease, 

15 glucose-galactose malabsorption syndrome, glycogen storage disease, hypercholesterolemia, 

adrenoleukodystrophy, Zellweger syndrome, Menkes disease, occipital horn syndrome, von Gierke 
disease, pseudohypoaldosteronism type 1, Liddle's syndrome, cystinuria, iminoglycinuria, Hartup 
disease, Fanconi disease, and Bartter syndrome; a neurological disorder such as epilepsy, ischemic 
cerebrovascular disease, stroke, cerebral neoplasms, Alzheimer's disease, Pick's disease, 

20 Huntington's disease, dementia, Parkinson's disease and other extrapyramidal disorders, 
amyotrophic lateral sclerosis and other motor neuron disorders, progressive neural muscular 
atrophy, retinitis pigmentosa, hereditary ataxias, multiple sclerosis and other demyelinating 
diseases, bacterial and viral meningitis, brain abscess, subdural empyema, epidural abscess, 
suppurative intracranial thrombophlebitis, myelitis and radiculitis, viral central nervous system 

25 disease, prion diseases including kuru, Creutzfeldt-Jakob disease, and Gerstmann : 

Straussler-Scheinker syndrome, fatal familial insomnia, nutritional and metabolic diseases of the 
nervous system, neurofibromatosis, tuberous sclerosis, cerebeUoretinal hemangioblastomatosis, 
encephalotrigeminal syndrome, mental retardation and other developmental disorders of the central 
nervous system including Down syndrome, cerebral palsy, neuroskeletal disorders, autonomic 

30 nervous system disorders, cranial nerve disorders, spinal cord diseases, muscular dystrophy and 
other neuromuscular disorders, peripheral nervous system disorders, dermatomyositis and 
polymyositis, inherited, metabolic, endocrine, and toxic myopathies, myasthenia gravis, periodic 
paralysis, mental disorders including mood, anxiety, and schizophrenic disorders, seasonal affective 
disorder (SAD), akathesia, amnesia, catatonia, diabetic neuropathy, hemipiegic migraine, tardive 

35 dyskinesia, dystonias, paranoid psychoses, postherpetic neuralgia, Tourette's disorder, progressive 
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supranuclear palsy, corticobasal degeneration, and familial frontotemporal dementia; a muscle 
disorder such as cardiomyopathy, myocarditis, Duchenne's muscular dystrophy, Becker's muscular 
dystrophy, myotonic dystrophy, central core disease, nemaline myopathy, centronuclear myopathy, 
lipid myopathy, mitochondrial myopathy, infectious myositis, polymyositis, dermatomyositis, 
5 inclusion body myositis, thyrotoxic myopathy, ethanol myopathy, angina, anaphylactic shock, 
arrhythmias, asthma, cardiovascular shock, Cushing's syndrome, hypertension, hypoglycemia, 
myocardial infarction, migraine, pheochromocytoma, and myopathies including encephalopathy, 
epilepsy, Kearns-Sayre syndrome, lactic acidosis, myoclonic disorder, ophthalmoplegia, acid 
maliase deficiency (AMD, also known as Pompe's disease), generalized myotonia, and myotonia 

10 congenita; an immunological disorder such as acquired immunodeficiency syndrome (AIDS), 
Addison's disease, adult respiratory distress syndrome, allergies, ankylosing spondylitis, 
amyloidosis, anemia, asthma, atherosclerosis, autoimmune hemolytic anemia, autoimmune ' 
thyroiditis, autoimmune polyendocrinopatliy-candidiasis-ectodermal dystrophy (APECED), 
bronchitis, cholecystitis, contact dermatitis, Crohn's disease, atopic dermatitis, dermatomyositis, 

15 diabetes mellitus, emphysema, episodic lymphopenia with lymphocytotoxins, erythroblastosis 
fetalis, erythema nodosum, atrophic gastritis, glomerulonephritis, Goodpasture's syndrome, gout, 
Graves' disease, Hashimoto's thyroiditis, hypereosinophilia, irritable bowel syndrome, multiple 
sclerosis, myasthenia gravis, myocardial or pericardial inflammation, osteoarthritis, osteoporosis, 
pancreatitis, polymyositis, psoriasis, Reiter's syndrome, rheumatoid arthritis, scleroderma, 

20 Sjogren's syndrome, systemic anaphylaxis, systemic lupus erythematosus, systemic sclerosis, 
thrombocytopenic purpura, ulcerative colitis, uveitis, Werner syndrome, complications of cancer, 
hemodialysis, and extracorporeal circulation, viral, bacterial, fungal, parasitic, protozoal, and 
helminthic infections, and trauma; and a cell proliferative disorder such as actinic keratosis, 
arteriosclerosis, atherosclerosis, bursitis, cirrhosis, hepatitis, mixed connective tissue disease 

25 (MCTD), myelofibrosis, paroxysmal nocturnal hemoglobinuria, polycythemia vera, psoriasis, 

primary thrombocythemia, and cancers including adenocarcinoma, leukemia, lymphoma, melanoma, 
myeloma, sarcoma, teratocarcinoma, and, in particular, cancers of the adrenal gland, bladder, bone, 
bone marrow, brain, breast, cervix, gall bladder, ganglia, gastrointestinal tract, heart, kidney, liver, 
lung, muscle, ovary, pancreas, parathyroid, penis, prostate, salivary glands, skin, spleen, testis, 

30 thymus, thyroid, and uterus. The polynucleotide sequences encoding TRICH may be used in 
Southern or northern analysis, dot blot, or other membrane-based technologies; in PCR 
technologies; in dipstick, pin, and multiformat ELISA-like assays; and in microarrays utilizing 
fluids or tissues from patients to detect altered TRICH expression. Such qualitative or quantitative 
methods are well known in the art. 

35 In a particular aspect, the nucleotide sequences encoding TRICH may be useful in assays 
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that detect the presence of associated disorders, particularly those mentioned above. The nucleotide 
sequences encoding TRICH may be labeled by standard methods and added to a fluid or tissue 
sample from a patient under conditions suitable for the formation of hybridization complexes. After 
a suitable incubation period, the sample is washed and the signal is quantified and compared with a 
5 standard value. If the amount of signal in the patient sample is significantly altered in comparison 
to a control sample then the presence of altered levels of nucleotide sequences encoding TRICH in 
the samplcindicates the presence of the associated disorder. Such assays may also be used to 
evaluate the efficacy of a particular therapeutic treatment regimen in animal studies, in clinical 
trials, or to monitor the treatment of an individual patient. 

10 In order to provide a basis for the diagnosis of a disorder associated with expression of 

TRICH, a normal or standard profile for expression is established. This may be accomplished by 
combining body fluids or cell extracts taken from normal subjects, either animal or human, with a <> 
sequence, or a fragment thereof, encoding TRICH, under conditions suitable for hybridization or 
amplification. Standard hybridization may be quantified by comparing the values obtained from 

15 normal subjects with values from an experiment in which a known amount of a substantially 
purified polynucleotide is used. Standard values obtained in this manner may be compared with 
values obtained from samples from patients who are symptomatic for a disorder. Deviation from 
standard values is used to establish the presence of a disorder. 

Once the presence of a disorder is established and a treatment protocol is initiated, 

20 hybridization assays may be repeated on a regular basis to determine if the level of expression in the 
patient begins to approximate that which is observed in the normal subject. The results obtained 
from successive assays may be used to show the efficacy of treatment over a period ranging from 
several days to months. 

With respect to cancer, the presence of an abnormal amount of transcript (either under- or 

25 overexpressed) in biopsied tissue from an individual may indicate a predisposition for the 
development of the disease, or may provide a means for detecting the disease prior to the 
appearance of actual clinical symptoms. A more definitive diagnosis of this type may allow health 
professionals to employ preventative measures or aggressive treatment earlier thereby preventing 
the development or further progression of the cancer. 

30 Additional diagnostic uses for oligonucleotides designed from the sequences encoding 

TRICH may involve the use of PCR. These oligomers may be chemically synthesized, generated 
enzymatically, or produced in vitro . Oligomers will preferably contain a fragment of a 
polynucleotide encoding TRICH, or a fragment of a polynucleotide complementary to the 
polynucleotide encoding TRICH, and will be employed under optimized conditions for 

35 identification of a specific gene or condition. Oligomers may also be employed under less stringent 
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conditions for detection or quantification of closely related DNA or RNA sequences. 

In a particular aspect, oligonucleotide primers derived from the polynucleotide sequences 
encoding TRICH may be used to detect single nucleotide polymorphisms (SNPs). SNPs are 
substitutions, insertions and deletions that are a frequent cause of inherited or acquired genetic 
5 disease in humans. Methods of SNP detection include, but are not limited to, single-stranded 
conformation polymorphism (SSCP) and fluorescent SSCP (fSSCP) methods. In SSCP, 
oligonucleotide primers derived from the polynucleotide sequences encoding TRICH are used to 
amplify DNA using the polymerase chain reaction (PCR). The DNA may be derived, for example, 
from diseased or normal tissue, biopsy samples, bodily fluids, and the like. SNPs in the DNA cause 

10 differences in the secondary and tertiary structures of PCR products in single-stranded form, and 
these differences are detectable using gel electrophoresis in non-denaturing gels. In fSCCP, the 
oligonucleotide primers are fluorescently labeled, which allows detection of the amplimers in high-^ 
throughput equipment such as DNA sequencing machines. Additionally, sequence database 
analysis methods, termed in silico SNP (isSNP), are capable of identifying polymorphisms by 

15 comparing the sequence of individual overlapping DNA fragments which assemble into a common 
consensus sequence. These computer-based methods filter out sequence variations due to 
laboratory preparation of DNA and sequencing errors using statistical models and automated 
analyses of DNA sequence chromatograms. In the alternative, SNPs may be detected and 
characterized by mass spectrometry using, for example, the high throughput MASSARRAY system 

20 (Sequenom, Inc., San Diego CA). 

Methods which may also be used to quantify the expression of TRICH include radiolabeling 
or biotinylating nucleotides, coamplification of a control nucleic acid, and interpolating results from 
standard curves. (See, e.g., Melby, P.C. et al. (1993) J. Immunol. Methods 159:235-244; Duplaa, C. 
et al. (1993) Anal. Biochem. 212:229-236.) The speed of quantitation of multiple samples may be 

25 accelerated by running the assay in a high-throughput format where the oligomer or polynucleotide 
of interest is presented in various dilutions and a spectrophotometric or colorimetric response gives 
rapid quantitation. 

In further embodiments, oligonucleotides or longer fragments derived from any of the 
polynucleotide sequences described herein may be used as elements on a microairay. The 

30 microairay can be used in transcript imaging techniques which monitor the relative expression 
levels of large numbers of genes simultaneously as described below. The microarray may also be 
used to identify genetic variants, mutations, and polymorphisms. This information may be used to 
determine gene function, to understand the genetic basis of a disorder, to diagnose a disorder, to 
monitor progression/regression of disease as a function of gene expression, and to develop and 

35 monitor the activities of therapeutic agents in the treatment of disease. In particular, this 
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information may be used to develop a pharmacogenomic profile of a patient in order to select the 
most appropriate and effective treatment regimen for that patient. For example, therapeutic agents 
which are highly effective and display the fewest side effects may be selected for a patient based on 
his/her pharmacogenomic profile. 
5 In another embodiment, TRICH, fragments of TRICH, or antibodies specific for TRICH 

may be used as elements on a microarray. The microarray may be used to monitor or measure 
protein-protein interactions, drug-target interactions! and gene expression profiles, as described 
above. 

A particular embodiment relates to the use of the polynucleotides of the present invention to 

10 generate a transcript image of a tissue or cell type. A transcript image represents the global pattern 
of gene expression by a particular tissue or cell type. Global gene expression patterns are analyzed 
by quantifying the number of expressed genes and their relative abundance under given conditions ■ ' 
and at a given time. (See Seilhamer et al., "Comparative Gene Transcript Analysis," U.S. Patent 
No. 5,840,484, expressly incorporated by reference herein.) Thus a transcript image may be 

15 generated by hybridizing the polynucleotides of the present invention or their complements to the 
totality of transcripts or reverse transcripts of a particular tissue or cell type. In one embodiment, 
the hybridization takes place in high-throughput format, wherein the polynucleotides of the present 
invention or their complements comprise a subset of a plurality of elements on a microarray. The 
resultant transcript image would provide a profile of gene activity. 

20 Transcript images may be generated using transcripts isolated from tissues, cell lines, 

biopsies,, or other biological samples. The transcript image may thus reflect gene expression in 
vivo , as in the case of a tissue or biopsy sample, or in vitro , as in the case of a cell line. 

Transcript images which profile the expression of the polynucleotides of the present 
invention may also be used in conjunction with in vitro model systems and preclinical evaluation of 

25 pharmaceuticals, as well as toxicological testing of industrial and naturally-occurring environmental 
compounds. All compounds induce characteristic gene expression patterns, frequently termed 
molecular fingerprints or toxicant signatures, which are indicative of mechanisms of action and 
toxicity (Nuwaysir, E.F. et al. (1999) Mol. Carcinog. 24:153-159; Steiner, S. and N.L. Anderson 
(2000) Toxicol. Lett. 1 12-1 13:467-471, expressly incorporated by reference herein). If a test 

30 compound has a signature similar to that of a compound with known toxicity, it is likely to share 
those toxic properties. These fingerprints or signatures are most useful and refined when they 
contain expression information from a large number of genes and gene families. Ideally, a genome- 
wide measurement of expression provides the highest quality signature. Even genes whose 
expression is not altered by any tested compounds are important as well, as the levels of expression 

35 of these genes are used to normalize the rest of the expression data. The normalization procedure is 
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useful for comparison of expression data after treatment with different compounds. While the 
assignment of gene function to elements of a toxicant signature aids in interpretation of toxicity 
mechanisms, knowledge of gene function is not necessary for the statistical matching of signatures 
which leads to prediction of toxicity. (See, for example, Press Release 00-02 from the National 

5 Institute of Environmental Health Sciences, released February 29, 2000, available at 

http://www.niehs.nih.gov/oc/news/toxchip.htm.) Therefore, it is important and desirable in 
toxicological screening using toxicant signatures to include all expressed gene sequences. 

In one embodiment, the toxicity of a test compound is assessed by treating a biological 
sample containing nucleic acids with the test compound. Nucleic acids that are expressed in the 

10 treated biological sample are hybridized with one or more probes specific to the polynucleotides of 
the present invention, so that transcript levels corresponding to the polynucleotides of the present 
invention may be quantified. The transcript levels in the treated biological sample are compared * 
with levels in an untreated biological sample. Differences in the transcript levels between the two 
samples are indicative of a toxic response caused by the test compound in the treated sample. 

15 Another particular embodiment relates to the use of the polypeptide sequences of the 

present invention to analyze the proteome of a tissue or cell type. The term proteome refers to the 
global pattern of protein expression in a particular tissue or cell type. Each protein component of a 
proteome can be subjected individually to further analysis. Proteome expression patterns, or 
profiles, are analyzed by quantifying the number of expressed proteins and their relative abundance 

20 under given conditions and at a given time. A profile of a cell's proteome may thus be generated by 
separating and analyzing the polypeptides of a particular tissue or cell type. In one embodiment, the 
separation is achieved using two-dimensional gel electrophoresis, in which proteins from a sample 
are separated by isoelectric focusing in the first dimension, and then according to molecular weight 
by sodium dodecyl sulfate slab gel electrophoresis in the second dimension (Steiner and Anderson, 

25 supra) . The proteins are visualized in the gel as discrete and uniquely positioned spots, typically by 
staining the gel with an agent such as Coomassie Blue or silver or fluorescent stains. The optical 
density of each protein spot is generally proportional to the level of the protein in the sample. The 
optical densities of equivalently positioned protein spots from different samples, for example, from 
biological samples either treated or untreated with a test compound or therapeutic agent, are 

30 compared to identify any changes in protein spot density related to the treatment. The proteins in 
the spots are partially sequenced using, for example, standard methods employing chemical or 
enzymatic cleavage followed by mass spectrometry. The identity of the protein in a spot may be 
determined by comparing its partial sequence, preferably of at least 5 contiguous amino acid 
residues, to the polypeptide sequences of the present invention. In some cases, further sequence 

35 data may be obtained for definitive protein identification. 
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A proteomic profile may also be generated using antibodies specific for TRICH to quantify 
the levels of TRICH expression! In one embodiment, the antibodies are used as elements on a 
microarray, and protein expression levels are quantified by exposing the microarray to the sample 
and detecting the levels of protein bound to each airay element (Lueking, A. et al. (1999) Anal. 
5 Biochem. 270: 103-1 11; Mendoze, L.G. et al. (1999) Biotechniques 27:778-788). Detection may be 
performed by a variety of methods known in the art, for example, by reacting the proteins in the 
sample with a thiol- or amino-reactive fluorescent compound and detecting the amount of 
fluorescence bound at each array element. 

Toxicant signatures at the proteome level are also useful for toxicological screening, and 

10 should be analyzed in parallel with toxicant signatures at the transcript level. There is a poor 

correlation between transcript and protein abundances for some proteins in some tissues (Anderson, 
N.L. and J. Seilhamer (1997) Electrophoresis 18:533-537), so proteome toxicant signatures may be' 
useful in the analysis of compounds which do not significantly affect the transcript image, but 
which alter the proteomic profile. In addition, the analysis of transcripts in body fluids is difficult, 

15 due to rapid degradation of mRNA, so proteomic profiling may be more reliable and informative in 
such cases. 

In another embodiment, the toxicity of a test compound is assessed by treating a biological 
sample containing proteins with the test compound. Proteins that are expressed in the treated 
biological sample are separated so that the amount of each protein can be quantified. The amount of 

20 each protein is compared to the amount of the corresponding protein in an untreated biological 
sample. A difference in the amount of protein between the two samples is indicative of a toxic 
response to the test compound in the treated sample. Individual proteins are identified by 
sequencing the amino acid residues of the individual proteins and comparing these partial sequences 
to the polypeptides of the present invention. 

25 In another embodiment, the toxicity of a test compound is assessed by treating a biological 

sample containing proteins with the test compound. Proteins from the biological sample are 
incubated with antibodies specific to the polypeptides of the present invention. The amount of 
protein recognized by the antibodies is quantified. The amount of protein in the treated biological 
sample is compared with the amount in an untreated biological sample. A difference in the amount 

30 of protein between the two samples is indicative of a toxic response to the test compound in the 
treated sample. 

Microarrays may be prepared, used, and analyzed using methods known in the art. (See, 
e.g., Brennan, T.M. et al. (1995) U.S. Patent No. 5,474,796; Schena, M. et al. (1996) Proc. Natl. 
Acad. Sci. USA 93:10614-10619; Baldeschweiler et al. (1995) PCT application W095/251116; 
35 Shalon, D. et al. (1995) PCT application WO95/35505; Heller, R.A. et al. (1997) Proc. Natl. Acad. 
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Sci. USA 94:2150-2155; and Heller, M.J. et al. (1997) U.S. Patent No. 5,605,662.) Various types of 
microarrays are well known and thoroughly described in DNA Microarravs: A Practical App roach. 
M. Schena, ed. (1999) Oxford University Press, London, hereby expressly, incorporated by 
reference. 

5 In another embodiment of the invention, nucleic acid sequences encoding TRICH may be 

used to generate hybridization probes useful in mapping the naturally occurring genomic sequence. 
Either coding or noncoding sequences may be used, and in some instances, noncoding sequences 
may be preferable over coding sequences. For example, conservation of a coding sequence among 
members of a multi-gene family may potentially cause undesired cross hybridization during 

10 chromosomal mapping. The sequences may be mapped to a particular chromosome, to a specific 
region of a chromosome, or to artificial chromosome constructions, e.g., human artificial 
chromosomes (HACs), yeast artificial chromosomes (YACs), bacterial artificial chromosomes ' 
(BACs), bacterial PI constructions, or single chromosome cDNA libraries. (See, e.g., Harrington, : 
JJ. et al. (1997) Nat. Genet. 15:345-355; Price, CM. (1993) Blood Rev. 7:127-134; and Trask, BJ. 

15 (1991) Trends Genet. 7:149-154.) Once mapped, the nucleic acid sequences of the invention may 
be used to develop genetic linkage maps, for example, which correlate the inheritance of a disease 
state with the inheritance of a particular chromosome region or restriction fragment length 
polymorphism (RFLP). (See, for example, Lander, E.S. and D. Botstein (1986) Proc. Natl. Acad. 
Sci. USA 83:7353-7357.) 

20 Fluorescent in situ hybridization (FISH) may be correlated with other physical and genetic 

map data. (See, e.g.,*Heinz-Ulrich, et al. (1995) in Meyers, supra , pp. 965-968.) Examples of 
genetic map data can be found in various scientific journals or at the Online Mendelian Inheritance 
in Man (OMIM) World Wide Web site. Correlation between the location of the gene encoding 
TRICH on a physical map and a specific disorder, or a predisposition to a specific disorder, may 

25 help define the region of DNA associated with that disorder and thus may further positional cloning 
efforts. 

Insitu hybridization of chromosomal preparations and physical mapping techniques, such as 
linkage analysis using established chromosomal markers, may be used for extending genetic maps. 
Often the placement of a gene on the chromosome of another mammalian species, such as mouse, 

30 may reveal associated markers even if the exact chromosomal locus is not known. This information 
is valuable to investigators searching for disease genes using positional cloning or other gene 
discovery techniques. Once the gene or genes responsible for a disease or syndrome have been 
crudely localized by genetic linkage to a particular genomic region, e.g., ataxia-telangiectasia to 
1 lq22-23, any sequences mapping to that area may represent associated or regulatory genes for 

35 further investigation. (See, e.g., Gatti, R.A. et al. (19S8) Nature 336:577-580.) The nucleotide 

* 
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sequence of the instant invention may also be used to detect differences in the chromosomal 
location due to translocation, inversion, etc., among normal, earner, or affected individuals. 

In another embodiment of the invention, TRICH, its catalytic or immunogenic fragments, or 
oligopeptides thereof can be used for screening libraries of compounds in any of a variety of drug 
5 screening techniques. The fragment employed in such screening may be free in solution, affixed to 
a solid support, borne on a cell surface, or located intracellularly . The formation of binding 
complexes between TRICH and the agent being tested may be measured. 

Another technique for drug screening provides for high throughput screening of compounds 
having suitable binding affinity to the protein of interest. (See, e.g., Geysen, et aL (1984) PCT 

10 application WOS4/U3564.) to this method, large numbers of different small test compounds are 
synthesized on a solid substrate. The test compounds are reacted with TRICH, or fragments thereof, 
and washed. Bound TRICH is then detected by methods well known in the art. Purified TRICH can 
also be coated directly onto plates for use in the aforementioned drug screening techniques 
Alternatively, non neutralizing antibodies can be used to capture the peptide and immobilize it on a 

15 solid support. 

In another embodiment, one may use competitive drug screening assays in which 

neutralizing antibodies capable of binding TRICH specifically compete with a test compound for 

binding TRICH. In this manner, antibodies can be used to detect the presence of any peptide which 

shares one or more antigenic determinants with TRICH. 
20 In additional embodiments, the nucleotide sequences which encode TRICH may be used in 

any molecular biology techniques that have yet to be developed, provided the new techniques rely 

on properties of nucleotide sequences that are currently known, including, but not limited to, such 

properties as the triplet genetic code and specific base pair interactions. 

Without further elaboration, it is believed that one skilled in the art can, using the preceding 
25 description, utilize the present invention to its fullest extent. The following embodiments are, 

therefore, to be consumed as merely illustrative, and not limitative of the remainder of the disclosure 

in any way whatsoever. 

The disclosures of all patents, applications and publications, mentioned above and below, 

including U.S. Ser. No. 60/254,303, U.S. Ser. No. 60/256,190, U.S. Ser. No. 60/257,504, U.S. Ser. 
30 No. 60/261,546, U.S. Ser. No. 60/262,832, U.S. Ser! No. 60/264,377, and U.S. Ser. No. 60/266,019, 

are expressly incorporated by reference herein. 

EXAMPLES 

I. Construction of cDNA Libraries 
35 Incyte cDNAs were derived from cDNA libraries described in the LIFESEQ GOLD 
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database (Incyte Genomics, Palo Alto CA). Some tissues were homogenized and lysed in 
guanidinium isothiocyanate, while others were homogenized and lysed in phenol or in a suitable 
mixture of denaturants, such as TRIZOL (Life Technologies), a monophasic solution of phenol and 
guanidine isothiocyanate. The resulting lysates were centrifuged over CsCl cushions or extracted 
5 with chloroform. RNA was precipitated from the lysates with either isopropanol or isodium acetate 
and ethanol, or by other routine methods. 

Phenol extraction and precipitation of RNA were repeated as necessary to increase RNA 
purity. In some cases, RNA was treated with DNase. For most libraries, poly(A)+ RNA was 
isolated using oligo d(T)-coupled paramagnetic particles (Promega), OLIGOTEX latex particles 

10 (QIAGEN, Chatsworth CA), or an OLIGOTEX mRNA purification kit (QIAGEN). Alternatively, 
RNA was isolated directly from tissue lysates using other RNA isolation kits, e.g., the 
POLY(A)PURE mRNA purification kit (Ambion, Austin TX). 

In some cases, Stratagene was provided with RNA and constructed the corresponding 
cDNA libraries. Otherwise, cDNA was synthesized and cDNA libraries were constructed with the 

15 UNIZAP vector system (Stratagene) or SUPERSCRIPT plasmid system (Life Technologies), using 
the recommended procedures or similar methods known in the art. (See, e.g., Ausubel, 1997, supra , 
units 5.1-6.6.) Reverse transcription was initiated using oligo d(T) or random primers. Synthetic 
oligonucleotide adapters were ligated to double stranded cDNA, and the cDNA was digested with, 
the appropriate restriction enzyme or enzymes. For most libraries, the cDNA was size-selected 

20 (300-1000 bp) using SEPHACRYL S 1000, SEPHAROSE CL2B, or SEPHAROSE CL4B column 
chromatography (Amersham Pharmacia Biotech) or preparative agarose gel electrophoresis. 
cDNAs were ligated into compatible restriction enzyme sites of the polylinker of a suitable plasmid, 
e.g., PBLUESCRDPT plasmid (Stratagene), PSPORT1 plasmid (Life Technologies), PCDNA2.1 
plasmid (Invitrogen, Carlsbad CA), PBK-CMV plasmid (Stratagene), PCR2-TOPOTA plasmid 

25 (Invitrogen), PCMV-ICIS plasmid (Stratagene), pIGEN (Incyte Genomics, Palo Alto CA), pRARE 
(Incyte Genomics), or pINCY (Incyte Genomics), or derivatives thereof. Recombinant plasmids 
were transformed into competent E. coli cells including XLl-Blue, XLl-BlueMRF, or SOLR from 
Stratagene or DH5a, DH10B, or ElectroMAX DH10B from Life Technologies. 
EL Isolation of cDNA Clones 

30 Plasmids obtained as described in Example I were recovered from host cells by in vivo 

excision using the UNIZAP vector system (Stratagene) or by cell lysis. Plasmids were purified 
using at least one of the following: a Magic or WIZARD Minipreps DNA purification system 
(Promega); an AGTC Miniprep purification kit (Edge Biosystems, Gaithersburg MD); and 
QIAWELL 8 Plasmid, QIAWELL 8 Plus Plasmid, QIAWELL 8 Ultra Plasmid purification systems 

35 or the R.E.A.L. PREP 96 plasmid purification kit from QIAGEN. Following precipitation, plasmids 
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were resuspended in 0.1 ml of distilled water and stored, with or without lyophilization, at 4°C. 

Alternatively, plasmid DNA was amplified from host cell lysates using direct link PCR in a 
high-throughput format (Rao, V.B. (1994) Anal. Biochem. 216:1-14). Host cell lysis and thermal 
cycling steps were carried out in a single reaction mixture. Samples were processed and stored in 
5 384-well plates, and the concentration of amplified plasmid DNA was quantified fluorometrically 
using PICOGREEN dye (Molecular Probes, Eugene OR) and a FLUOROSKAN II fluorescence 
scanner (Labsystems Oy, Helsinki, Finland). 
III. Sequencing and Analysis 

Incyte cDNA recovered in plasmids as described in Example II were sequenced as follows. 

10 Sequencing reactions were processed using standard methods or high-throughput instrumentation 
such as the ABI CATALYST 800 (Applied Biosystems) thermal cycler or the PTC-200 thermal 
cycler (MJ Research) in conjunction with the HYDRA microdispenser (Robbins Scientific) or the ✓ 
MICROLAB 2200 (Hamilton) liquid transfer system. cDNA sequencing reactions were prepared 
using reagents provided by Amersham Pharmacia Biotech or supplied in ABI sequencing kits such 

15 as the ABI PRISM BIGDYE Terminator cycle sequencing ready reaction kit (Applied Biosystems). 
Electrophoretic separation of cDNA sequencing reactions and detection of labeled polynucleotides 
were carried out using the MEGABACE 1000 DNA sequencing system (Molecular Dynamics); the 
ABI PRISM 373 or 377 sequencing system (Applied Biosystems) in conjunction with standard ABI 
protocols and base calling software; or other sequence analysis systems known in the art. Reading 

20 frames within the cDNA sequences were identified using standard methods (reviewed in Ausubel, 
1997, supra , unit 7.7). Some of the cDNA sequences were selected for extension using the 
techniques disclosed in Example VUL 

The polynucleotide sequences derived from Incyte cDNAs were validated by removing 
vector, linker, and poly(A) sequences and by masking ambiguous bases, using algorithms and 

25 programs based on BLAST, dynamic programming, and dinucleotide nearest neighbor analysis. 
The Incyte cDNA sequences or translations thereof were then queried against a selection of public 
databases such as the GenBank primate, rodent, mammalian, vertebrate, and eukaryote databases, 
and BLOCKS, PRINTS, DOMO, PRODOM; PROTEOME databases with sequences from Homo 
sapiens , Rattus norvegicus , Mus musculus » Caenorhabditis eleeans , Saccharomvces cerevisiae , 

30 Schizosaccharomvces pombe . and Candida albicans (Incyte Genomics, Palo Alto CA); and hidden 
Markov model (HMM)-based protein family databases such as PFAM. (HMM is a probabilistic 
approach which analyzes consensus primary structures of gene families. See, for example, Eddy, 
S.R. (1996) Curr. Opin. Struct. Biol. 6:361-365.) The queries were performed using programs 
based on BLAST, FASTA, BLIMPS, and HMMER. The Incyte cDNA sequences were assembled 

35 to produce full length polynucleotide sequences. Alternatively, GenBank cDNAs, GenBank ESTs, 
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stitched sequences, stretched sequences, or Genscan-predicted coding sequences (see Examples IV 
and V) were used to extend Incyte cDNA assemblages to full length. Assembly was performed 
using programs based on Phred, Phrap, and Consed, and cDNA assemblages were screened for open 
reading frames using programs based on GeneMark, BLAST, and FASTA. The full length 
5 polynucleotide sequences were translated to derive the corresponding full length polypeptide 
sequences. Alternatively, a polypeptide of the invention may begin at any of the methionine 
residues of the full length translated polypeptide. Full length polypeptide sequences were 
subsequently analyzed by querying against databases such as the GenBank protein databases 
(genpept), SwissProt, the PROTEOME databases, BLOCKS, PRINTS, DOMO, PRODOM, Prosite, 

10 and hidden Markov model (HMM)-based protein family databases such as PFAM. Full length 
polynucleotide sequences are also analyzed using MACDNASIS PRO software (Hitachi Software 
Engineering, South San Francisco CA) and LASERGENE software (DNASTAR). Polynucleotide' 
and polypeptide sequence alignments are generated using default parameters specified by the 
CLUSTAL algorithm as incorporated into the MEGALIGN multisequence alignment program 

15 (DNASTAR), which also calculates the percent identity between aligned sequences. 

Table 7 summarizes the tools, programs, and algorithms used for the analysis and assembly 
of Incyte cDNA and full length sequences and provides applicable descriptions, references, and 
threshold parameters. The first column of Table 7 shows the tools, programs, and algorithms used, 
the second column provides brief descriptions thereof, the third column presents appropriate 

20 references, all of which are incorporated by reference herein in their entirety, and the fourth column 
presents, where applicable, the scores, probability values, and other parameters used to evaluate the 
strength of a match between two sequences (the higher the score or the lower the probability value, 
the greater the identity between two sequences). 

The programs described above for the assembly and analysis of full length polynucleotide 

25 and polypeptide sequences were also used to identify polynucleotide sequence fragments from SEQ 
ID NO:33-64. Fragments from about 20 to about 4000 nucleotides which are useful in hybridization 
and amplification technologies are described in Table 4, column 2. 
IV. Identification and Editing of Coding Sequences from Genomic DNA 

Putative transporters and ion channels were initially identified by running the Genscan gene 

30 identification program against public genomic sequence databases (e.g., gbpri and gbhtg). Genscan 
is a general-purpose gene identification program which analyzes genomic DNA sequences from a 
variety of organisms (See Burge, C. and S. Karlin (1997) J. Mol. Biol. 268:78-94, and Burge, C. and 
S. Karlin (1998) Cunr. Opin. Struct. Biol. 8:346-354). The program concatenates predicted exons to 
form an assembled cDNA sequence extending from a methionine to a stop codon. The output of 

35 Genscan is a FASTA database of polynucleotide and polypeptide sequences. The maximum range 
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of sequence for Genscan to analyze at once was set to 30 kb. To determine which of these Genscan 
predicted cDNA sequences encode transporters and ion channels, the encoded polypeptides were 
analyzed by querying against PFAM models for transporters and ion channels. Potential 
transporters and ion channels were also identified by homology to Incyte cDNA sequences that had 
5 been annotated as transporters and ion channels. These selected Genscan-predicted sequences were 
then compared by BLAST analysis to the genpept and gbpri public databases. Where necessary, the 
Genscan-predicted sequences were then edited by comparison to the top BLAST hit from genpept to 
correct errors in the sequence predicted by Genscan, such as extra or omitted exons. BLAST ' 
analysis was also used to find any Incyte cDNA or public cDNA coverage of the Genscan-predicted 

10 sequences, thus providing evidence for transcription. When Incyte cDNA coverage was available, 
this information was used to correct or confirm the Genscan predicted sequence. Full length 
polynucleotide sequences were obtained by assembling Genscan-predicted coding sequences with ' 
Incyte cDNA sequences and/or public cDNA sequences using the assembly process described in 
Example EL Alternatively, full length polynucleotide sequences were derived entirely from edited 

15 or unedited Genscan-predicted coding sequences. 

V. Assembly of Genomic Sequence Data with cDNA Sequence Data 
"Stitched" Sequences 

Partial cDNA sequences were extended with exons predicted by the Genscan gene 
identification program described in Example IV. Partial cDNAs assembled as described in Example 

20 m were mapped to genomic DNA and parsed into clusters containing related cDNAs and Genscan 
exon predictions from one or more genomic sequences. Each cluster was analyzed using an 
algorithm based on graph theory and dynamic programming to integrate cDNA and genomic 
information, generating possible splice variants that were subsequently confirmed, edited, or 
extended to create a full length sequence. Sequence intervals in which the entire length of the 

25 interval was present on more than one sequence in the cluster were identified, and intervals thus 
identified were considered to be equivalent by transitivity. For example, if an interval was present 
on a cDNA and two genomic sequences, then all three intervals were considered to be equivalent. 
This process allows unrelated but consecutive genomic sequences to be brought together, bridged by 
cDNA sequence. Intervals thus identified were then "stitched" together by the stitching algorithm 

30 in the order that they appear along their parent sequences to generate the longest possible sequence, 
as well as sequence variants. Linkages between intervals which proceed along one type of parent 
sequence (cDNA to cDNA or genomic sequence to genomic sequence) were given preference over 
linkages which change parent type (cDNA to genomic sequence). The resultant stitched sequences 
were translated and compared by BLAST analysis to the genpept and gbpri public databases. 

35 Incorrect exons predicted by Genscan were corrected by comparison to the top BLAST hit from 
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genpept. Sequences were further extended with additional cDNA sequences, or by inspection of 
genomic DNA, when necessary. 
"Stretched" Sequences 

Partial DNA sequences were extended to full length with an algorithm based on BLAST 
5 analysis. First, partial cDNAs assembled as described in Example in were queried against public 
databases such as the GenBank primate, rodent, mammalian, vertebrate, and eukaryote databases 
using the BLAST program. The nearest GenBank protein homolog was then compared by BLAST 
analysis to either Incyte cDNA sequences or GenScan exon predicted sequences described in 
Example IV. A chimeric protein was generated by using the resultant high-scoring segment pairs 

10 (HSPs) to map the translated sequences onto the GenBank protein homolog. Insertions or deletions 
may occur in the chimeric protein with respect to the original GenBank protein homolog. The 
GenBank protein homolog, the chimeric protein, or both were used as probes to search for ' 
homologous genomic sequences from the public human genome databases. Partial DNA sequences 
were therefore "stretched" or extended by the addition of homologous genomic sequences. The 

15 resultant stretched sequences were examined to determine whether it contained a complete gene. 
VI. Chromosomal Mapping of TRICH Encoding Polynucleotides 

The sequences which were used to assemble SEQ ID NO:33-64 were compared with 
sequences from the Incyte LIFESEQ database and public domain databases using BLAST and other 
implementations of the Smith-Waterman algorithm. Sequences from these databases that matched 

20 SEQ ID NO:33-64 were assembled into clusters of contiguous and overlapping sequences using 
assembly algorithms such as Phrap (Table 7). Radiation hybrid and genetic mapping data available 
from public resources such as the Stanford Human Genome Center (SHGC), Whitehead Institute for 
Genome Research (WIGR), and Genethon were used to determine if any of the clustered sequences 
had been previously mapped. Inclusion of a mapped sequence in a cluster resulted in the 

25 assignment of all sequences of that cluster, including its particular SEQ ID NO:, to that map 
location. 

Map locations are represented by ranges, or intervals, of human chromosomes. The map 
position of an interval, in centiMorgans, is measured relative to the terminus of the chromosome's 
p-arm. (The centiMorgan (cM) is a unit of measurement based on recombination frequencies 

30 between chromosomal markers. On average, 1 cM is roughly equivalent to 1 megabase (Mb) of 
DNA in humans, although this can vary widely due to hot and cold spots of recombination.) The 
cM distances are based on genetic markers mapped by Gen6thon which provide boundaries for 
radiation hybrid markers whose sequences were included in each of the clusters. Human genome 
maps and other resources available to the public, such as the NCBI "GeneMap*99" World Wide 

35 Web site (http://www.ncbi.nlm.nih.gov/genemap/), can be employed to determine if previously 
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identified disease genes map within or in proximity to the intervals indicated above. 
VIL Analysis of Polynucleotide Expression 

Northern analysis is a laboratory technique used to detect the presence of a transcript of a 
gene and involves the hybridization of a labeled nucleotide sequence to a membrane on which 
5 RNAs from a particular cell type or tissue have been bound. (See, e.g., Sambrook, supra , ch. 7; 
Ausubel (1995) supra , ch. 4 and 16.) 

Analogous computer techniques applying BLAST were used to search for identical or 
related molecules in cDNA databases such as GenBank or LIFESEQ (Incyte Genomics). This 
analysis is much faster than multiple membrane-based hybridizations. In addition, the sensitivity of 
10 the computer search can be modified to determine whether any particular match is categorized as 
exact or similar. The basis of the search is the product score, which is defined as: 

. • ■ . s 

BLAST Score x Percent Identity 
5 x minimum {length(Seq. 1), length(Seq. 2)} 

15 

The product score takes into account both the degree of similarity between two sequences and the 
length of the sequence match. The product score is a normalized value between 0 and 100, and is 
calculated as follows: the BLAST score is multiplied by the percent nucleotide identity and the 
product is divided by (5 times the length of the shorter of the two sequences). The BLAST score is 

20 calculated by assigning a score of +5 for every base that matches in a high-scoring segment pair 
(HSP), and -4 for every mismatch. Two sequences may share more than one HSP (separated by 
gaps). If there is more than one HSP, then the pair with the highest BLAST score is used to 
calculate the product score. The product score represents a balance between fractional overlap and 
quality in a BLAST alignment For example, a product score of 100 is produced only for 100% 

25 identity over the entire length of the shorter of the two sequences being compared. A product score 
of 70 is produced either by 100% identity and 70% overlap at one end, or by 88% identity and 100% 
overlap at the other. A product score of 50 is produced either by 100% identity and 50% overlap at 
one end, or 79% identity and 100% overlap. 

Alternatively, polynucleotide sequences encoding TRICH are analyzed with respect to the 

30 tissue sources from which they were derived. For example, some full length sequences are 

assembled, at least in part, with overlapping Incyte cDNA sequences (see Example HI). Each cDNA 
sequence is derived from a cDNA library constructed from a human tissue. Each human tissue is 
classified into one of the following organ/tissue categories: cardiovascular system; connective 
tissue; digestive system; embryonic structures; endocrine system; exocrine glands; genitalia, female; 

35 genitalia, male; germ cells; hemic and immune system; liver, musculoskeletal system; nervous 
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system; pancreas; respiratory system; sense organs; skin; stomatognathic system; 
unclassified/mixed; or urinary tract. The number of libraries in each category is counted and 
divided by the total number of libraries across all categories. Similarly, each human tissue is 
classified into one of the following disease/condition categories: cancer, cell line, developmental, 
5 inflammation, neurological, trauma, cardiovascular, pooled, and other, and the number of libraries 
in each category is counted and divided by the total number of libraries across ail categories. The 
resulting percentages reflect the tissue- and disease-specific expression of cDNA encoding TRICH. 
cDNA sequences and cDNA library/tissue information are found in the LIPESEQ GOLD database 
(Incyte Genomics, Palo Alto CA). 

10 VIII. Extension of TRICH Encoding Polynucleotides 

Full length polynucleotide sequences were also produced by extension of an appropriate 
fragment of the full length molecule using oligonucleotide primers designed from this fragment. 
One primer was synthesized to initiate 5' extension of the known fragment, and the other primer was 
synthesized to initiate 3' extension of the known fragment. The initial primers were designed using 

15 OLIGO 4.06 software (National Biosciences), or another appropriate program, to be about 22 to 30 
nucleotides in length, to have a GC content of about 50% or more, and to anneal to the target 
sequence at temperatures of about 68 °C to about 72 °C. Any stretch of nucleotides which would 
result in hairpin structures and primer-primer dimerizations was avoided. 

Selected human cDNA libraries were used to extend the sequence. If more than one 

20 extension was necessary or desired, additional or nested sets of primers were designed. 

High fidelity amplification was obtained by PCR using methods well known in the art. PCR 
was performed in 96-well plates using the PTC-200 thermal cycler (MJ Research, Inc.). The 
reaction mix contained DNA template, 200 nmol of each primer, reaction buffer containing Mg 2+ , 
(NH 4 ) 2 S0 4 , and 2-mercaptoethanol, Taq DNA polymerase (Amersham Pharmacia Biotech), 

25 ELONGASE enzyme (Life Technologies), and Pfu DNA polymerase (Stratagene), with the 

following parameters for primer pair PCI A and Pa B: Step 1: 94°C, 3 min; Step 2: 94°C, 15 sec; 
Step 3: 60°C, 1 min; Step 4: 68 °C, 2 min; Step 5: Steps 2, 3, and 4 repeated 20 times; Step 6: 68 °C, 
5 min; Step 7: storage at 4°C. In the alternative, the parameters for primer pair T7 and SK+ were as 
follows: Step 1: 94°C, 3 min; Step 2: 94°C, 15 sec; Step 3: 57°C, 1 min; Step 4: 68°C, 2 min; Step 

30 5: Steps 2, 3, and 4 repeated 20 times; Step 6: 68°C, 5 min; Step 7: storage at 4°C 

The concentration of DNA in each well was determined by dispensing 100 fil PICOGREEN 
quantitation reagent (0.25% (v/v) PICOGREEN; Molecular Probes, Eugene OR) dissolved in IX TE 
and 0.5 fil of undiluted PCR product into each well of an opaque fluorimeter plate (Corning Costar, 
Acton MA), allowing the DNA to bind to the reagent The plate was scanned in a Fluoroskan II 

35 (Labsystems Oy, Helsinki, Finland) to measure the fluorescence of the sample and to quantify the 
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concentration of DNA. A 5 /zl to 10 /zl aliquot of the reaction mixture was analyzed by 
electrophoresis on a 1 % agarose gel to determine which reactions were successful in extending the 
sequence. 

The extended nucleotides were desalted and concentrated, transferred to 384 r well plates, 
5 digested with CviJI cholera virus endonuclease (Molecular Biology Research, Madison WI), and 
sonicated or sheared prior to religation into pUC 18 vector (Amersham Pharmacia Biotech). For 
shotgun sequencing, the digested nucleotides were separated on low concentration (0.6 to 0.8%) 
agarose gels, fragments were excised, and agar digested with Agar ACE (Promega). Extended 
clones were religated using T4 ligase (New England Biolabs, Beverly MA) into pUC 18 vector 

10 (Amersham Pharmacia Biotech), treated with Pfu DNA polymerase (Stratagene) to fill-in restriction 
site overhangs, and transfected into competent E. coli cells. Transformed cells were selected on 
antibiotic-containing media, and individual colonies were picked and cultured overnight at 37 °C ur" 
384-well plates in LB/2x carb liquid media. 

The cells were lysed, and DNA was amplified by PCR using Taq DNA polymerase 

15 (Amersham Pharmacia Biotech) and Pfu DNA polymerase (Stratagene) with the following 

parameters: Step 1: 94°C, 3 min; Step 2: 94°C, 15 sec; Step 3: 60°C, 1 min; Step 4: 72°C, 2 min; " 
Step 5: steps 2, 3, and 4 repeated 29 times; Step 6: 72 °C, 5 min; Step 7: storage at 4°C. DNA was 
quantified by PICOGREEN reagent (Molecular Probes) as described above. Samples with low 
DNA recoveries were reamplified using the same conditions as described above. Samples were 

20 diluted with 20% dimethysulfoxide (1:2, v/v), and sequenced using DYENAMIC energy transfer 
sequencing primers and the DYENAMIC DIRECT kit (Amersham Pharmacia Biotech) or the ABI 
PRISM BIGDYE Terminator cycle sequencing ready reaction kit (Applied Biosystems). 

In like manner, full length polynucleotide sequences are verified using the above procedure* 
or are used to obtain 5' regulatory sequences using the above procedure along with oligonucleotides 

25 designed for such extension, and an appropriate genomic library . 
IX. Labeling and Use of Individual Hybridization Probes 

Hybridization probes derived from SEQ ID NO:33-64 are employed to screen cDNAs, 
genomic DNAs, or mRNAs. Although the labeling of oligonucleotides, consisting of about 20 base 
pairs, is specifically described, essentially the same procedure is used with larger nucleotide 

30 fragments. Oligonucleotides are designed using state-of-the-art software such as OLIGO 4.06 
software (National Biosciences) and labeled by combining 50 pmol of each oligomer, 250 /iCi of 
[y- 32 P] adenosine triphosphate (Amersham Pharmacia Biotech), and T4 polynucleotide kinase 
(DuPont NEN, Boston MA). The labeled oligonucleotides are substantially purified using a 
SEPHADEX G-25 superfine size exclusion dextran bead column (Amersham Pharmacia Biotech). 

35 An aliquot containing 10 7 counts per minute of the labeled probe is used in a typical membrane- 
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based hybridization analysis of human genomic DNA digested with one of the following 
endonucleases: Ase I, Bgl n, Eco RI, Pst I, Xba I, or Pvu II (DuPont NEN). 

The DNA from each digest is fractionated on a 0.7% agarose gel and transferred to nylon 
membranes (Nytran Plus, Schleicher & Schuell, Durham NH). Hybridization is carried out for 16 
5 hours at 40°C. To remove nonspecific signals, blots are sequentially washed at room temperature 
under conditions of up to, for example, 0.1 x saline sodium citrate and 0.5% sodium dodecyl sulfate. 
Hybridization patterns are visualized using autoradiography or an alternative imaging means and 
compared. 

X. Microarrays 

10 The linkage or synthesis of array elements upon a microarray can be achieved utilizing 

photolithography, piezoelectric printing (ink-jet printing, See, e.g., Baldeschweiler, supra ,). . 
mechanical microspotting technologies, and derivatives thereof. The substrate in each of the . ^ 
aforementioned technologies should be uniform and solid with a non-porous surface (Schena 
(1999), supra) . Suggested substrates include silicon, silica, glass slides, glass chips, and silicon 

15 wafers. Alternatively, a procedure analogous to a dot or slot blot may also be used to arrange and 
link elements to the surface of a substrate using thermal, UV, chemical, or mechanical bonding 
procedures. A typical array may be produced using available methods and machines well known to 
those of ordinary skill in the art and may contain any appropriate number of elements. (See, e.g., 
Schena, M. et al. (1995) Science 270:467-470; Shalon, D. et al. (1996) Genome Res. 6:639-645; 

20 Marshall, A. and J. Hodgson (1998) Nat. Biotechnol. 16:27-31.) 

Full length cDNAs, Expressed Sequence Tags (ESTs), or fragments or oligomers thereof 
may comprise the elements of the microarray. Fragments or oligomers suitable for hybridization 
can be selected using software well known in the art such as LASERGENE software (DNASTAR). 
The array elements are hybridized with polynucleotides in a biological sample. The polynucleotides 

25 in the biological sample are conjugated to a fluorescent label or other molecular tag for ease of 
detection. After hybridization, nonhybridized nucleotides from the biological sample are removed, 
and a fluorescence scanner is used to detect hybridization at each array element. Alternatively, laser 
desorbtion and mass spectrometry may be used for detection of hybridization. The degree of 
complementarity and the relative abundance of each polynucleotide which hybridizes to an element 

30 on the microarray may be assessed. In one embodiment, microarray preparation and usage is 
described in detail below. 
Tissue or Cell Sample Preparation 

Total RNA is isolated from tissue samples using the guanidinium thiocyanate method and 
poly(A) + RNA is purified using the oligo-(dT) cellulose method. Each poly(A) + RNA sample is 

35 reverse transcribed using MMLV reverse-transcriptase, 0.05 pg//il oligo-(dT) primer (21mer), IX 
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first strand buffer, 0.03 units//d RNase inhibitor, 500 /xM dATP, 500 fiM dGTP, 500 fiM dTTP, 40 
fiM dCTP, 40 /tM dCTP-Cy3 (BDS) or dCTP-Cy5 (Amersham Pharmacia Biotech). The reverse 
transcription reaction is performed in a 25 ml volume containing 200 ng poly (A) + RNA with 
GEMBRIGHT kits (Incyte). Specific control poly(A) + RNAs are synthesized by in vitro 
5 transcription from non-coding yeast genomic DNA. After incubation at 37 °C for 2 hr, each reaction 
sample (one with Cy3 and another with Cy5 labeling) is treated with 2.5 ml of 0.5M sodium 
hydroxide and incubated for 20 minutes at 85 °C to the stop the reaction and degrade the RNA. 
Samples are purified using two successive CHROMA SPIN 30 gel filtration spin columns 
(CLONTECH Laboratories, Inc. (CLONTECH), Palo Alto CA) and after combining, both reaction 

10 samples are ethanol precipitated using 1 ml of glycogen (1 mg/ml), 60 ml sodium acetate, and 300 
ml of 100% ethanol. The sample is then dried to completion using a SpeedVAC (Savant 
Instruments Inc., Holbrook NY) and resuspended in 14 fil 5X SSC/0.2% SDS. y 
Microarrav Preparation 

Sequences of the present invention are used to generate array elements. Each array element 

15 is amplified from bacterial cells containing vectors with cloned cDNA inserts. PCR amplification 
uses primers complementary to the vector sequences flanking the cDNA insert Array elements are 
amplified in thirty cycles of PCR from an initial quantity of 1-2 ng to a final quantity greater than 5 
/ig. Amplified array elements are then purified using SEPHACRYL-400 (Amersham Pharmacia 
Biotech). 

20 Purified array elements are immobilized on polymer-coated glass slides. Glass microscope 

slides (Corning) are cleaned by ultrasound in 0.1% SDS and acetone, with extensive distilled water 
washes between and after treatments. Glass slides are etched in 4% hydrofluoric acid (VWR 
Scientific Products Corporation (VWR), West Chester PA), washed extensively in distilled water, 
and coated with 0.05% aminopropyl silane (Sigma) in 95% ethanol. Coated slides are cured in a 

25 110°Coven. 

Array elements are applied to the coated glass substrate using a procedure described in U.S. 
Patent No. 5,807,522, incorporated herein by reference. 1 p\ of the array element DNA, at an 
average concentration of 100 ng//xl, is loaded into the open capillary printing element by a high- 
speed robotic apparatus. The apparatus then deposits about 5 nl of array element sample per slide. 

30 Microarrays are UV-crosslinked using a STRATALINKER UV-crosslinker (Stratagene). 

Microarrays are washed at room temperature once in 0.2% SDS and three times in distilled water. 
Non-specific binding sites are blocked by incubation of microarrays in 0.2% casein in phosphate 
buffered saline (PBS) (Tropix, Inc., Bedford MA) for 30 minutes at 60° C followed by washes in 
0.2% SDS and distilled water as before. 

35 Hybridization 

83 



•JSDOCID: <WO 024641 5A2_IA> 



WO 02/46415 



PCT/US01/46963 



Hybridization reactions contain 9 fil of sample mixture consisting of 0.2 fig each of Cy3 and 
Cy5 labeled cDNA synthesis products in 5X SSC, 0.2% SDS hybridization buffer. The sample 
mixture is heated to 65° C for 5 minutes and is aliquoted onto the microarray surface and covered 
with an 1.8 cm 2 coverslip. The arrays are transferred to a waterproof chamber having a cavity just 
5 slightly larger than a microscope slide. The chamber is kept at 100% humidity internally by the 
addition of 140 jxl of 5X SSC in a corner of the chamber. The chamber containing the arrays is 
incubated for about 6.5 hours at 60° C. The arrays are washed for 10 min at 45° C in a first wash 
buffer (IX SSC, 0.1% SDS), three times for 10 minutes each at 45°C in a second wash buffer (0.1X 
SSC), and dried. 
10 Detection 

Reporter-labeled hybridization complexes are detected with a microscope equipped with an 
Innova 70 mixed gas 10 W laser (Coherent, Inc., Santa Clara CA) capable of generating spectral ' 
lines at 488 nm for excitation of Cy3 and at 632 nm for excitation of Cy5. The excitation laser light 
is focused on the array using a 20X microscope objective (Nikon, Inc., Melville NY). The slide 

15 containing the array is placed on a computer-controlled X-Y stage on the microscope and raster- 
scanned past the objective. The 1.8 cm x 1.8 cm array used in the present example is scanned with 
a resolution of 20 micrometers. 

In two separate scans, a mixed gas multiline laser excites the two fluorophores sequentially. 
Emitted light is split, based on wavelength, into two photomultiplier tube detectors (PMT R1477, 

20 Hamamatsu Photonics Systems, Bridgewater NJ) corresponding to the two fluorophores. 

Appropriate filters positioned between the array and the photomultiplier tubes are used to filter the 
signals. The emission maxima of the fluorophores used are 565 nm for Cy3 and 650 nm for Cy5. 
Each array is typically scanned twice, one scan per fluorophore using the appropriate filters at the 
laser source, although the apparatus is capable of recording the spectra from both fluorophores 

25 simultaneously. 

The sensitivity of the scans is typically calibrated using the signal intensity generated by a 
cDNA control species added to the sample mixture at a known concentration. A specific location 
on the array contains a complementary DNA sequence, allowing the intensity of the signal at that 
location to be correlated with a weight ratio of hybridizing species of 1:100,000. When two 

30 samples from different sources (e.g., representing test and control cells), each labeled with a 

different fluorophore, are hybridized to a single array for the purpose of identifying genes that are 
differentially expressed, the calibration is done by labeling samples of the calibrating cDNA with 
the two fluorophores and adding identical amounts of each to the hybridization mixture. 

The output of the photomultiplier tube is digitized using a 12-bit RTI-835H analog-to- 

35 digital (A/D) conversion board (Analog Devices, Inc., Norwood MA) installed in an IBM- 
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compatible PC computer. The digitized data are displayed as an image where the signal intensity is 
mapped using a linear 20-color transformation to a pseudocolor scale ranging from blue (low 
signal) to red (high signal). The data is also analyzed quantitatively. Where two different 
fluorophores are excited and measured simultaneously, the data are first corrected for optical 
5 crosstalk (due to overlapping emission spectra) between the fluorophores using each fluorophore's 
emission spectrum. 

A grid is superimposed over the fluorescence signal image such that the signal from each 
spot is centered in each element of the grid. The fluorescence signal within each element is then 
integrated to obtain a numerical value corresponding to the average intensity of the signal. The 

10 software used for signal analysis is the GEMTOOLS gene expression analysis program (Incyte). 
XI. Complemeiitar) Polynucleotides 

Sequences complementary to the TRICH-encoding sequences, or any parts thereof, are used 
to detect, decrease, or inhibit expression of naturally occurring TRICH. Although use of 
oligonucleotides comprising from about 15 to 30 base pairs is described, essentially the same 

15 procedure is used with smaller or with larger sequence fragments. Appropriate oligonucleotides are 
designed using OLIGO 4.06 software (National Biosciences) and the coding sequence of TRICH. 
To inhibit transcription, a complementary oligonucleotide is designed from the most unique 5' 
sequence and used to prevent promoter binding to the coding sequence. To inhibit translation, a 
complementary oligonucleotide is designed to prevent ribosomal binding to the TRICH-encoding 

20 transcript. 

XIL Expression of TRICH 

Expression and purification of TRICH is achieved using bacterial or virus-based expression 
systems. For expression of TRICH in bacteria, cDNA is subcloned into an appropriate vector 
containing an antibiotic resistance gene and an inducible promoter that directs high levels of cDNA 

25 transcription. Examples of such promoters include, but are not limited to, the trp-lac (tac) hybrid 
promoter and the T5 or T7 bacteriophage promoter in conjunction with the lac operator regulatory 
element. Recombinant vectors are transformed into suitable bacterial hosts, e.g., BL21(DE3). 
Antibiotic resistant bacteria express TRICH upon induction with isopropyl beta-D- 
thiogalactopyranoside (IPTG). Expression of TRICH in eukaryotic cells is achieved by infecting 

30 insect or mammalian cell lines with recombinant Autographica califomica nuclear polyhedrosis 
virus (AcMNPV), commonly known as baculovirus. The nonessential polyhedrin gene of 
baculovirus is replaced with cDNA encoding TRICH by either homologous recombination or 
bacterial-mediated transposition involving transfer plasmid intermediates. Viral infectivity is 
maintained and the strong polyhedrin promoter drives high levels of cDNA transcription. 

35 Recombinant baculovirus is used to infect Spodoptera frugjperda (Sf9) insect cells in most cases, or 
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human hepatocytes, in some cases. Infection of the latter requires additional genetic modifications 
to baculovirus. (See Engelhard, E.K. et al. (1994) Proc. Natl. Acad. Sci. USA 91:3224-3227; 
Sandig, V. et al. (1996) Hum. Gene Then 7:1937-1945.) 

In most expression systems, TRICH is synthesized as a fusion protein with, e.g., glutathione 
5 S-transferase (GST) or a peptide epitope tag, such as FLAG or 6-His, permitting rapid, single-step, 
affinity-based purification of recombinant fusion protein from crude cell lysates. GST, a 26- 
kilodalton enzyme from Schistosoma japonienm, enables the purification of fusion proteins on 
immobilized glutathione under conditions that maintain protein activity and antigenicity (Amersham 
Pharmacia Biotech). Following purification, the GST moiety can be proteolytically cleaved from 
10 TRICH at specifically engineered sites. FLAG, an 8-amino acid peptide, enables immunoaffinity 
purification using commercially available monoclonal and polyclonal anti-FLAG antibodies 
(Eastman Kodak). 6-His, a stretch of six consecutive histidine residues, enables purification on ' 
metal-chelate resins (QIAGEN). Methods for protein expression and purification are discussed in 
Ausubel (1995, supra, ch. 10 and 16). Purified TRICH obtained by these methods can be used 

15 directly in the assays shown in Examples XVI, XVII, and XVm where applicable. 
XIII. Functional Assays 

TRICH function is assessed by expressing the sequences encoding TRICH at 
physiologically elevated levels in mammalian cell culture systems. cDNA is subcloned into a 
mammalian expression vector containing a strong promoter that drives high levels of cDNA 

20 expression. Vectors of choice include PCMV SPORT (Life Technologies) and PCR3. 1 (Invitrogen, 
Carlsbad CA), both of which contain the cytomegalovirus promoter. 5-10 /ig of recombinant vector 
are transiently transfected into a human cell line, for example, an endothelial or hematopoietic cell 
line, using either liposome formulations, or electroporation. 1-2 /^g of an additional plasmid 
containing sequences encoding a marker protein are co-transfected. Expression of a marker protein 

25 provides a means to distinguish transfected cells from nontransfected cells and is a reliable predictor 
of cDNA expression from the recombinant vector. Marker proteins of choice include, e.g., Green 
Fluorescent Protein (GFP; Clontech), CD64, or a CD64-GFP fusion protein. Flow cytometry 
(FCM), an automated, laser optics-based technique, is used to identify transfected cells expressing 
GFP or CD64-GFP and to evaluate the apoptotic state of the cells and other cellular properties. 

30 FCM detects and quantifies the uptake of fluorescent molecules that diagnose events preceding or 
coincident with cell death. These events include changes in nuclear DNA content as measured by 
staining of DNA with propidium iodide; changes in cell size and granularity as measured by forward 
light scatter and 90 degree side light scatter; down-regulation of DNA synthesis as measured by 
decrease in bromodeoxyuridine uptake; alterations in expression of cell surface and intracellular 

35 proteins as measured by reactivity with specific antibodies; and alterations in plasma membrane 
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composition as measured by the binding of fluorescein-conjugated Annexin V protein to the cell 
surface. Methods in flow cytometry are discussed in Ormerod, M.G. (1994) Flow Cytometry . 
Oxford, New York NY. 

The influence of TRICH on gene expression can be assessed using highly purified 
5 populations of cells transfected with sequences encoding TRICH and either CD64 or CD64-GFP. 
CD64 and CD64-GFP are expressed on the surface of transfected cells and bind to conserved 
regions of human immunoglobulin G (IgG). Transfected cells are efficiently separated from 
nontransfected cells using magnetic beads coated with either human IgG or antibody against CD64 
(DYNAL, Lake Success NY). mRNA can be purified from the cells using methods well known by 
10 those of skill in the art. Expression of mRNA encoding TRICH and other genes of interest can be 
analyzed by northern analysis or microarray techniques. 

XTV. Production of TRICH Specific Antibodies ' 

TRICH substantially purified using polyacrylamide gel electrophoresis (PAGE; see, e.g., 
Harrington, M.G. (1990) Methods Enzymol. 182:488-495), or other purification techniques, is used 

15 to immunize rabbits and to produce antibodies using standard protocols. 

Alternatively, the TRICH amino acid sequence is analyzed using LASERGENE software 
(DNASTAR) to determine regions of high immunogenicity, and a corresponding oligopeptide is 
synthesized and used to raise antibodies by means known to those of skill in the art. Methods for 
selection of appropriate epitopes, such as those near the C-terminus or in hydrophilic regions are 

20 well described in the art (See, e.g., Ausubel, 1995, supra , ch. 11.) 

Typically, oligopeptides of about 15 residues in length are synthesized using an ABI 431 A 
peptide synthesizer (Applied Biosystems) using FMOC chemistry and coupled to KLH (Sigma- 
Atdrich, St. Louis MO) by reaction with N-maleimidobenzoyl-N-hydroxysuccinimide ester (MBS) 
to increase immunogenicity. (See, e.g., Ausubel, 1995, supra .) Rabbits are immunized with the 

25 oligopeptide-KLH complex in complete Freund's adjuvant. Resulting antisera are tested for 

antipeptide and anti-TRICH activity by, for example, binding the peptide or TRICH to a substrate, 
blocking with 1% BSA, reacting with rabbit antisera, washing, and reacting with radio-iodinated 
goat anti-rabbit IgG. 

XV. Purification of Naturally Occurring TRICH Using Specific Antibodies 
30 Naturally occurring or recombinant TRICH is substantially purified by immunoaffinity 

chromatography using antibodies specific for TRICH: An immunoaffinity column is constructed by 
covalently coupling anti-TRICH antibody to an activated chromatographic resin, such as 
CNBr-activated SEPHAROSE (Amersham Pharmacia Biotech). After the coupling, the resin is 
blocked and washed according to the manufacturer's instructions. 
35 Media containing TRICH are passed over the immunoaffinity column, and the column is 

87 



:.OCID: <WO___0246415A2JA> 



WO 02/46415 



PCT/US0 1/46963 



washed under conditions that allow the preferential absorbance of TRICH (e.g., high ionic strength 
buffers in the presence of detergent). The column is eluted under conditions that disrupt 
antibody/TRICH binding (e.g., a buffer of pH 2 to pH 3, or a high concentration of a chaotrope, 
such as urea or thiocyanate ion), and TRICH is collected. 
5 XVI. Identification of Molecules Which Interact with TRICH 

Molecules which interact with TRICH may include transporter substrates, agonists or 
antagonists, modulatory proteins such as Gffy proteins (Reimann, supra) or proteins involved in 
TRICH locaUzation or clustering such as MAGUKs (Craven, supraV TRICH, or biologically active 
fragments thereof, are labeled with 125 I Bolton-Hunter reagent. (See, e.g., Bolton A.E. and W.M. 
10 Hunter (1973) Biochem. J. 1 33:529-539.) Candidate molecules previously arrayed in the wells of a 
multi-well plate arc incubated with the labeled TRICH, washed, and any wells with labeled TRICH 
complex are assayed. Data obtained using different concentrations of TRICH are used to calculate ' 
values for the number, affinity, and association of TRICH with the candidate molecules. 

Alternatively, proteins that interact with TRICH are isolated using the yeast 2-hybrid system 
15 (Fields, S. and O. Soog ( 19S9) Nature 340:245-246). TRICH, or fragments thereof, are expressed as 
fusion proteins wiih the DNA binding domain of Gal4 or lexA, and potential interacting proteins are 
expressed as fusion proteins with an activation domain. Interactions between the TRICH fusion 
protein and the TRICH interacting proteins (fusion proteins with an activation domain) reconstitute 
a transactivation function that is observed by expression of a reporter gene. Yeast 2-hybrid systems 
20 axe commercially available, and methods for use of the yeast 2-hybrid system with ion channel 
proteins are discussed in Niethammer, M. and M. Sheng (1998, Meth. Enzymol. 293:104-122). 

TRICH may also be used in the PATHCALLING process (CuraGen Corp., New Haven CT) 
which employs the yeast two-hybrid system in a high-throughput manner to determine all 
interactions between the proteins encoded by two large libraries of genes (Nandabalan, K. et aL 
25 (2000) U.S. Patent No. 6,057,101). 

Potential TRICH agonists or antagonists may be tested for activation or inhibition of 
TRICH ion channel activity using the assays described in section XVIII. 
XVII. Demonstration of TRICH Activity 

Ion channel activity of TRICH is demonstrated using an electrophysiological assay for ion 
30 conductance. TRICH can be expressed by transforming a mammalian cell line such as COS7, HeLa 
or CHO with a eukaryotic expression vector encoding TRICH. Eukaryotic expression vectors are 
commercially available, and the techniques to introduce them into cells are well known to those 
skilled in the ait. A second plasmid which expresses any one of a number of marker genes, such as 
8-galactosidase, is co-transformed into the cells to allow rapid identification of those cells which 
35 have taken up and expressed the foreign DNA. The cells are incubated for 4S-72 hours after 
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transformation under conditions appropriate for the cell line to allow expression and accumulation 
of TRICH and B-galactosidase. 

Transformed cells expressing B-galactosidase are stained blue when a suitable colorimetric 
substrate is added to the culture media under conditions that are well known in the art. Stained cells 
5 are tested for differences in membrane conductance by electrophysiological techniques that are well 
known in the art. Untransformed cells, and/or cells transformed with either vector sequences alone 
or B-galactosidase sequences alone, are used as controls and tested in parallel Cells expressing 
TRJCH will have higher anion or cation conductance relative to control cells. The contribution of 
TRICH to conductance can be confirmed by incubating the cells using antibodies specific for 

10 TRICH. The antibodies will bind to the extracellular side of TRICH, thereby blocking the pore in 
the ion channel, and the associated conductance. 

Alternatively, ion channel activity of TRICH is measured as current flow across a TRICH^ 
containing Xenopus laevis oocyte membrane using the two-electrode voltage-clamp technique (Ishi 
et al., supra : Jegla, T. and L. Salkoff (1997) J. Neurosci. 17:32-44). TRICH is subcloned into an 

15 appropriate Xenopus oocyte expression vector, such as pBF, and 0.5-5 ng of mRNA is injected into 
mature stage IV oocytes. Injected oocytes are incubated at 18 °C for 1-5 days. Inside-out 
macropatches are excised into an intracellular solution containing 116 mM K-gluconate, 4 mM KC1, 
and 10 mM Hepes (pH 7.2). The intracellular solution is supplemented with varying concentrations 
of the TRICH mediator, such as cAMP, cGMP, or Ca +2 (in the form of CaCJ 2 ), where appropriate. 

20 Electrode resistance is set at 2-5 MQ and electrodes are filled with the intracellular solution lacking 
mediator. Experiments are performed at room temperature from a holding potential of 0 mV. 
Voltage ramps (2.5 s) from -100 to 100 mV are acquired at a sampling frequency of 500 Hz. 
Current measured is proportional to the activity of TRICH in the assay. In particular, the activity of 
TRICH-1 is measured as Na + conductance and the activity of TRICH-3 is measured as Ca 2+ 

25 conductance. 

Transport activity of TRICH is assayed by measuring uptake of labeled substrates into 
Xenopus laevis oocytes. Oocytes at stages V and VI are injected with TRICH mRNA (10 ng per 
oocyte) and incubated for 3 days at 18°C in OR2 medium (82.5mM NaCl, 2.5 mM KC1, ImM CaCl 
2 , ImM MgCl 2 , ImM Na 2 HP0 4 , 5 mM Hepes, 3.8 mM NaOH , 50/ig/ml gentamycin, pH 7.8) to 

30 allow expression of TRICH. Oocytes are then transferred to standard uptake medium (lOOmM 
NaCl, 2 mM KC1, ImM CaCl 2 ; ImM MgCl 2 , 10 mM Hepes/Tris pH 7.5). Uptake of various 
substrates (e.g., amino acids, sugars, drugs, ions, and neurotransmitters) is initiated by adding 
labeled substrate (e.g. radiolabeled with 3 H, fluorescently labeled with rhodamine, etc.) to the 
oocytes. " After incubating for 30 minutes, uptake is terminated by washing the oocytes three times 

35 in NaMree medium, measuring the incorporated label, and comparing with controls. TRICH 
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activity is proportional to the level of internalized labeled substrate. Test substrates include ran- 
GTP for TRICH- 2, glucose for TRICH-5, amino acids for TRICH-6 and TRICH-14, cations for 
TRICH-7 and TRICH-16, Na + , K + and CI" ions for TRICH-15, reduced folate or analogues such as 
methotrexate for TRICH-17, divalent cations for TRICH-18, anions such as arsenate and antimonite 
5 for TRICH- 19, and nitrate or oligopeptides for TRICH-20. 

ATPase activity associated with TRICH can be measured by hydrolysis of radiolabeled 
ATP-[y- 32 P], separation of the hydrolysis products by chromatographic methods, and quantitation of 
the recovered 32 P using a scintillation counter. The reaction mixture contains ATP-[y- 32 P] and 
varying amounts of TRICH in a suitable buffer incubated at 37 °C for a suitable period of time. The 

10 reaction is terminated by acid precipitation with trichloroacetic acid and then neutralized with base, 
and an aliquot of the reaction mixture is subjected to membrane or filter paper-based 
chromatography to separate the reaction products. The amount of 32 P liberated is counted in a ■ ' 
scintillation counter. The amount of radioactivity recovered is proportional to the ATPase activity 
of TRICH in the assay. 

15 Lipocalin activity of TRICH is measured by ligand fluorescence enhancement 

spectrofluorometry (Lin et al. (1997) Molecular Vision 3:17). Examples of ligands include retinol 
(Sigma, St. Louis MO) and 16-anthryloxy-palmitic acid (16-AP) (Molecular Probes Inc., Eugene 
OR). Ligand is dissolved in 100% ethanol and its concentration is estimated using known 
extinction coefficents (retinol: 46,000 A/M/cm at 325 nm; 16-AP: 8,200 A/M/cm at 361 nm). A 

20 700 id aliquot of 1 fiM TRICH in 10 mM Tris (pH 7.5), 2 mM EDTA, and 500 mM NaCl is placed 
in a 1 cm path length quartz cuvette and 1 /xl aliquots of ligand solution are added. Fluorescence is 
measured 100 seconds after each addition until readings are stable. Change in fluorescence per unit 
change in ligand concentration is proportional to TRICH activity. 
XVIII. Identification of TRICH Agonists and Antagonists 

25 TRICH is expressed in a eukaryotic cell line such as CHO (Chinese Hamster Ovary) or 

HEK (Human Embryonic Kidney) 293. Ion channel activity of the transformed cells is measured in 
the presence and absence of candidate agonists or antagonists. Ion channel activity is assayed using 
patch clamp methods well known in the art or as described in Example XVII. Alternatively, ion 
channel activity is assayed using fluorescent techniques that measure ion flux across the cell 

30 membrane (Velicelebi, G. et al. (1999) Meth. Enzymol. 294:20-47; West, M.R. and C.R. Molloy 
(1996) Anal. Biochem. 241:51-58). These assays may be adapted for high-throughput screening 
using microplates. Changes in internal ion concentration are measured using fluorescent dyes such 
as the Ca 2+ indicator Fluo-4 AM, sodium-sensitive dyes such as SBFI and sodium green, or the CI* 
indicator MQAE (all available from Molecular Probes) in combination with the FLIPR fluorimetric 

35 plate reading system (Molecular Devices). In a more generic version of this assay, changes in 
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membrane potential caused by ionic flux across the plasma membrane are measured using oxonyl 
dyes such as DiBAQ (Molecular Probes). DiBAC 4 equilibrates between the extracellular solution 
and cellular sites according to the cellular membrane potential. The dye's fluorescence intensity is 
20-fold greater when bound to hydrophobic intracellular sites, allowing detection of DiBAQ entry 
5 into the cell (Gonzalez, J.E. and P.A. Negulescu (1998) Curr. Opin. Biotechnol. 9:624-631). 

Candidate agonists or antagonists may be selected from known ion channel agonists or antagonists, 
peptide libraries, or combinatorial chemical libraries. 

Various modifications and variations of the described methods and systems of the invention 
will be apparent to those skilled in the art without departing from the scope and spirit of the 
10 invention. Although the invention has been described in connection with certain embodiments, it 
should be understood that the invention as claimed should not be unduly limited to such specific 
embodiments. Indeed, various modifications of the described modes for carrying out the invention^ 
which are obvious to those skilled in molecular biology or related fields are intended to be within 
the scope of the following claims. 
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What is claimed is: 

1. An isolated polypeptide selected from the group consisting of: 

a) a polypeptide comprising an amino acid sequence selected from the group 
consisting of SEQ ID NO: 1-32, 

b) a polypeptide comprising a naturally occurring amino acid sequence at least 90% 
identical to an amino acid sequence selected from the group consisting of SEQ ID 
NO: 1-32, 

c) a biologically active fragment of a polypeptide having an amino acid sequence 
selected from the group consisting of SEQ ID NO: 1-32, and 

d) an immunogenic fragment of a polypeptide having an amino acid sequence selected 
from the group consisting of SEQ ID NO: 1-32. ' 

2. An isolated polypeptide of claim 1 comprising an amino acid sequence selected from the 
group consisting of SEQ ID NO: 1-32. 

3. An isolated polynucleotide encoding a polypeptide of claim 1. 

4. An isolated polynucleotide encoding a polypeptide of claim 2. 

5. An isolated polynucleotide of claim 4 comprising a polynucleotide sequence selected 
from the group consisting of SEQ ID NO:33-64. 

6. A recombinant polynucleotide comprising a promoter sequence operably linked to a 
polynucleotide of claim 3. 

7. A cell transformed with a recombinant polynucleotide of claim 6. 

8. A transgenic organism comprising a recombinant polynucleotide of claim 6. 

9. A method of producing a polypeptide of claim 1, the method comprising: 

a) culturing a cell under conditions suitable for expression of the polypeptide, wherein 
said cell is transformed with a recombinant polynucleotide, and said recombinant 
polynucleotide comprises a promoter sequence operably linked to a polynucleotide 
encoding the polypeptide of claim 1, and 
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10. A method of claim 9, wherein the polypeptide comprises an amino acid sequence 
selected from the group consisting of SEQ ID NO: 1-32. 

5 

1 1. An isolated antibody which specifically binds to a polypeptide of claim 1. 

12. An isolated polynucleotide selected from the group consisting of: 

a) a polynucleotide comprising a polynucleotide sequence selected from the group 
10 consisting of SEQ ID NO:33-64, 

b) a polynucleotide comprising a naturally occurring polynucleotide sequence at least 
90* identical lo a polynucleotide sequence selected from the group consisting of y 
SEQ ID N0 33-64. 

c) a polynucleotide complementary to a polynucleotide of a), 

15 d) a polynucleotide complementary to a polynucleotide of b), and 

e) an RNA equivalent of a)-d). 

13. An isolated polynucleotide comprising at least 60 contiguous nucleotides of a 
polynucleotide of claim 12. 

20 

14. A method of delecting a target polynucleotide in a sample, said target polynucleotide 
having a sequence of a polynucleotide of claim 12, the method comprising: 

a) hybridizing the sample with a probe comprising at least 20 contiguous nucleotides 
comprising a sequence complementary to said target polynucleotide in the sample, 

25 and which probe specifically hybridizes to said target polynucleotide, under 

conditions whereby a hybridization complex is formed between said probe and said 
target polynucleotide or fragments thereof, and 

b) detecting the presence or absence of said hybridization complex, and, optionally, if 
present, the amount thereof. 
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15. A method of claim 14, wherein the probe comprises at least 60 contiguous nucleotides. 



16. A method of detecting a target polynucleotide in a sample, said target polynucleotide 
having a sequence of a polynucleotide of claim 12, the method comprising: 
35 a) amplifying said target polynucleotide or fragment thereof using polymerase chain 

133 



NSDOCID: <WO 024641 5A2JA> 



WO 02/46415 



PCT/US01/46963 



reaction amplification, and 
b) detecting the presence or absence of said amplified target polynucleotide or 
fragment thereof, and, optionally, if present, the amount thereof. 

17. A composition comprising a polypeptide of claim 1 and a pharmaceutically acceptable 
excipient. 

18. A composition of claim 17, wherein the polypeptide comprises an amino acid sequence 
selected from the group consisting of SEQ ID NO: 1-32. 

19. A method for treating a disease or condition associated with decreased expression of 
functional TRICH, comprising administering to a patient in need of such treatment the 
composition of claim 17. 

20. Amethod of screening a compound for effectiveness as an agonist of a polypeptide of 
claim 1, the method comprising: 

a) exposing a sample comprising a polypeptide of claim 1 to a compound, and 

b) detecting agonist activity in the sample. 

21. A composition comprising an agonist compound identified by a method of claim 20 and 
a pharmaceutically acceptable excipient. 

22. A method for treating a disease or condition associated with decreased expression of 
functional TRICH, comprising administering to a patient in need of such treatment a 

25 composition of claim 2 1 . 

23. A method of screening a compound for effectiveness as an antagonist of a polypeptide 
of claim 1, the method comprising: 

a) exposing a sample comprising a polypeptide of claim 1 to a compound, and 
30 b) detecting antagonist activity in the sample. 

24. A composition comprising an antagonist compound identified by a method of claim 23 
and a pharmaceutically acceptable excipient. 

35 25. A method for treating a disease or condition associated with overexpression of 
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functional TRICH, comprising administering to a patient in need of such treatment a 
composition of claim 24. 

26. A method of screening for a compound that specifically binds to the polypeptide of 
claim 1, the method comprising: 

a) combining the polypeptide of claim 1 with at least one test compound under 
suitable conditions, and 

b) detecting binding of the polypeptide of claim 1 to the test compound, thereby 
identifying a compound that specifically binds to the polypeptide of claim 1. 

27. A method of screening for a compound that modulates the activity of the polypeptide of 
claim 1, the method comprising: 

a) combining the polypeptide of claim 1 with at least one test compound under 
conditions permissive for the activity of the polypeptide of claim 1, 

b) assessing the activity of the polypeptide of claim 1 in the presence of the test 
compound, and 

c) comparing the activity of the polypeptide of claim 1 in the presence of the test 
compound with the activity of the polypeptide of claim 1 in the absence of the test 
compound, wherein a change in the activity of the polypeptide of claim 1 in the 
presence of the test compound is indicative of a compound that modulates the 
activity of the polypeptide of claim 1. 

28. A method of screening a compound for effectiveness in altering expression of a target 
polynucleotide, wherein said target polynucleotide comprises a sequence of claim 5, the 
method comprising: 

a) exposing a sample comprising the target polynucleotide to a compound, under 
conditions suitable for the expression of the target polynucleotide, 

b) detecting altered expression of the target polynucleotide, and 

c) comparing the expression of the target polynucleotide in the presence of varying 
amounts of the compound and in the absence of the compound. 

29. A method of assessing toxicity of a test compound, the method comprising: 

a) treating a biological sample containing nucleic acids with the test compound, 

b) hybridizing the nucleic acids of the treated biological sample with a probe 
comprising at least 20 contiguous nucleotides of a polynucleotide of claim 12 under 
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conditions whereby a specific hybridization complex is formed between said probe 
and a target polynucleotide in the biological sample, said target polynucleotide 
comprising a polynucleotide sequence of a polynucleotide of claim 12 or fragment 
thereof, 

c) quantifying the amount of hybridization complex, and 

d) comparing the amount of hybridization complex in the treated biological sample 
with the amount of hybridization complex in an untreated biological sample, 
wherein a difference in the amount of hybridization complex in the treated 
biological sample is indicative of toxicity of the test compound. 

30. A diagnostic test for a condition or disease associated with the expression of TRICH in 
a biological sample, the method comprising: 

a) combining the biological sample with an antibody of claim 11, under conditions 
suitable for the antibody to bind the polypeptide and form an antibody-.polypeptide 
complex, and 

b) detecting the complex, wherein the presence of the complex correlates with the 
presence of the polypeptide in the biological sample. 

31. The antibody of claim 11, wherein the antibody is: 

a) a chimeric antibody, 

b) a single chain antibody, 

c) a Fab fragment, 

d) a FCab'), fragment, or 

e) a humanized antibody. 

32. A composition comprising an antibody of claim 11 and an acceptable excipient. 



33. A method of diagnosing a condition or disease associated with the expression of 
TRICH in a subject, comprising administering to said subject an effective amount of the 

composition of claim 32. 



34. A composition of claim 32, wherein the antibody is labeled. 



35. 

TRICH 



A method of diagnosing a condition or disease associated with the expression of 
in a subject, comprising administering to said subject an effective amount of the 

136 



WO 02/46415 



PCT/US01/46963 



composition of claim 34. 



36. A method of preparing a polyclonal antibody with the specificity of the antibody of 
claim 1 1 , the method comprising: 

5 a) immunizing an animal with a polypeptide consisting of an amino acid sequence 

selected from the group consisting of SEQ ID NO: 1-32, or an immunogenic 
fragment thereof, under conditions to elicit an antibody response, 

b) isolating antibodies from said animal, and 

c) screening the isolated antibodies with the polypeptide, thereby identifying a 

10 polyclonal antibody which binds specifically to a polypeptide comprising an amino 

acid sequence selected from the group consisting of SEQ ID NO: 1-32. , 

37. A polyclonal antibody produced by a 'method of claim 36. 

15 38. A composition comprising the polyclonal antibody of claim 37 and a suitable carrier. 

39. A method of making a monoclonal antibody with the specificity of the antibody of 
claim 1 1, the method comprising: 

a) immunizing an animal with a polypeptide consisting of an amino acid sequence 
20 selected from the group consisting of SEQ ID NO: 1-32, or an immunogenic 

fragment thereof, under conditions to elicit an antibody response, 

b) isolating antibody producing cells from the animal, 

c) fusing the antibody producing cells with immortalized cells to form monoclonal 
antibody-producing hybridoma cells, 

25 d) culturing the hybridoma cells, and 

e) isolating from the culture monoclonal antibody which binds specifically to a 

polypeptide comprising an amino acid sequence selected from the group consisting 
ofSEQIDNO:l~32. 

30 40. A monoclonal antibody produced by a method of claim 39. 

41. A composition comprising the monoclonal antibody of claim 40 and a suitable carrier. 

42. The antibody of claim 11, wherein the antibody is produced by screening a Fab 
35 expression library. 
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43. The antibody of claim 11, wherein the antibody is produced by screening a recombinant 
immunoglobulin library. 

44. A method of detecting a polypeptide comprising an amino acid sequence selected from 
5 the group consisting of SEQ ID NO: 1-32 in a sample, the method comprising: 

a) incubating the antibody of claim 1 1 with a sample under conditions to allow 
specific binding of the antibody and the polypeptide, and 

b) detecting specific binding, wherein specific binding indicates the presence of a 
polypeptide comprising an amino acid sequence selected from the group consisting 

10 of SEQ ID NO: 1-32 in the sample. 

45. A method of purifying a polypeptide comprising an amino acid sequence selected from 
the group consisting of SEQ ID NO: 1-32 from a sample, the method comprising: 

a) incubating the antibody of claim 1 1 with a sample under conditions to allow 
15 specific binding of the antibody and the polypeptide, and 

b) separating the antibody from the sample and obtaining the purified polypeptide 
comprising an amino acid sequence selected from the group consisting of SEQ ID 
NO: 1-32. 

20 46. A microarray wherein at least one element of the microarray is a polynucleotide of 

claim 13. 

47. A method of generating an expression profile of a sample which contains 
polynucleotides, the method comprising: 

25 a) labeling the polynucleotides of the sample, 

b) contacting the elements of the microarray of claim 46 with the labeled 
polynucleotides of the sample under conditions suitable for the formation of a 
hybridization complex, and 

c) quantifying the expression of the polynucleotides in the sample. 

30 

48. An array comprising different nucleotide molecules affixed in distinct physical 
locations on a solid substrate, wherein at least one of said nucleotide molecules comprises a 
first oligonucleotide or polynucleotide sequence specifically hybridizable with at least 30 

contiguous nucleotides of a target polynucleotide, and wherein said target polynucleotide is 
35 a polynucleotide of claim 12. 
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49. An array of claim 48, wherein said first oligonucleotide or polynucleotide sequence is 
completely complementary to at least 30 contiguous nucleotides of said target 
polynucleotide. 

5 50. An array of claim 48, wherein said first oligonucleotide or polynucleotide sequence is 

completely complementary to at least 60 contiguous nucleotides of said target 
polynucleotide. 



51. An array of claim 48, wherein said first oligonucleotide or polynucleotide sequence is 
10 completely complementary to said target polynucleotide. 

52. An array of claim 48, which is a microarray. 

53. An array of claim 48, further comprising said target polynucleotide hybridized to a 
15 nucleotide molecule comprising said first oligonucleotide or polynucleotide sequence. 

54. An array of claim 48, wherein a linker joins at least one of said nucleotide molecules to 
said solid substrate. 



55. An array of claim.48, wherein each distinct physical location on the substrate contains 
multiple nucleotide molecules, and the multiple nucleotide molecules at any single distinct 
physical location have the same sequence, and each distinct physical location on the 

substrate contains nucleotide molecules having a sequence which differs from the sequence 
of nucleotide molecules at another distinct physical location on the substrate. 

56. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:l. 

57. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:2. 
30 58. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:3. 

59. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:4. 

60. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:5. 

35 
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61. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:6. 

62. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:7. 

63. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:8. 

64. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:9. 

65. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO: 10. 

66. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO: 1 1. 

67. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO: 12. 

68. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO: 13. 

69. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO: 14. 

70. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:15. 

71. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO: 16. 

72. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO: 17. 

73. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO: IS. 

74. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO: 19. 

75. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:20. 

76. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:21. 

77. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:22. 

78. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:23. 
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79. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:24. 
SO. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:25. 

81 . A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:26. 

82. A polypeptide of claim 1, comprising the amino acid sequence of SEQ JD NO:27. 

83. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:28. 

84. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:29. 

85. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:30. 
15 86. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:31. 

87. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:32. 

S8. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 
20 NO:33. 

89. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 
NO:34. 

25 90. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:35. 

91. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 
NO:36. 

30 

92. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 
NO:37. 

93. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 
35 NO:38. 
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94. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 
NO:39. 

95. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 
5 NO:40. 

96. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 
NO:41. 

10 97. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:42. 

98. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 
NO:43. 

15 

99. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 
NO:44. 

• 100. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID' 
20 NO:45. 

» 

101. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 
NO:46. 

25 102. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:47. 

103. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 
NO:4S. 

30 

104. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 
NO:49. 

105. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 
35 NO:50. 
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106. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ED 
NO:51. 

107. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 
5 NO:52. 

108. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 
NO:53. 

10 109. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ DD 

NO:54. 

1 10. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 
NO:55. 

15 

1 1 1. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 
NO:56. 

1 12. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 
20 NO:57. 

1 13. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 
NO:58. 

25 1 14. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:59. 

1 15. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 
NO:60. 

30 

1 16. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 
NO:61. 

1 17. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 
35 NO:62. 
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1 18. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 
NO:63. 

1 19. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 
5 NO:64. 
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<110> INCYTE GENOMICS, INC. 
LEE, Ernestine A. 
BAUGHN, Marian R. 
YUE, Henry 
DING, Li 

RAUMANN, Brigette E. 
HAFALIA, April J. A. 
KHAN, Farrah A. 
NGUYEN, Danniel B. 
ELLIOTT, Vicki S. 
RAMKUMAR , Jaya 1 axmi 
WALIA, Narinder K. 
I SON, Craig H. 
LU, Yan 

GANDHI, Ameena R. 
WARREN, Bridget A. 
DUGGAN, Brendan M. 
TRIBOULEY, Catherine M. 
BURFORD, Neil 
LU, Dyung Aina M. 
LAL, Preeti 
YAO, Monique G. 
XU, Yuming 

BRUNS, Christopher M. 
THANGAVELU, Kavitha 
SWARNAKAR, Anita 
TANG, Y. Tom 
AZIMZAI, Yalda 
THORNTON, Michael 
ARVIZU, Chandra 
POLICKY, Jennifer L. 

<120> TRANSPORTERS AND ION CHANNELS 

<130> PI-0314 PCT 

<140> To Be Assigned 
<141> Herewith 

<150> 60/254,303; 60/256,190; 60/257,504; 60/261,546; 60/262,832; 

60/264,377; 60/266,019 
<151> 2000-12-08; 2000-12-15; 2000-12-21; 2001-01-12; 2001-01-19; 

2001-01-26; 2000-02-02 

<160> 64 

<170> PERL Program 

<210> 1 

<211> 726 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 7484831CD1 

<400> 1 

Met Glu Pro Gly Asp Ala Ala Arg Pro Gly Ser Gly Arg Ala Thr 

15 10 15 

Gly Ala Pro Pro Pro Arg Leu Leu Leu Leu Pro Leu Leu Leu Gly 

20 25 30 

Trp Gly Leu Arg Val Ala Ala Ala Ala Ser Ala Ser Ser Ser Gly 
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Ala 


Ala 


Ala 


Glu 


Asp Ser Ser 


Ala 










50 




Lys 


Glu 


Ala 


Glu 


Glu Ser His 


Arg 










65 


Thr 


Phe 


He 


Leu 


Leu Leu Thr 


Leu 










80 




Phe 


Lys 


His 


Arg 


Arg Val Arg 


Phe 










95 




Met 


He 


Tyr Gly Leu He Val 


Gly 










110 


Pro 


Ala 


Thr 


Ser Gly Arg Asp 


Lys 










125 




Asp 


Arg 


Ala 


Phe 


Ser Thr Leu 


Leu 










140 




Phe 


Glu 


Tyr 


Thr 


Leu Lys Gly 


Glu 










155 




Ser 


Val 


Glu 


Gin 


Asn Asp Met 


Leu 










170 




Glu 


Val 


Phe 


Phe 


Asn He Leu 


Leu 










185 




Gly 


Tyr 


Ser 


Leu 


Lys Lys Arg 


His 










200 




He 


Leu 


Ala 


Tyr 


Ala Phe Leu 


Gly 










215 


He 


Gly Asn Leu 


Met Tyr Gly Val 










230 




Gly Gin Leu 


Ser 


Asp Lys Phe 


Tyr 










245 




Gly Ala 


He 


He 


Ser Ala Thr 


Asp 










260 


Phe 


Asn 


Glu 


Leu 


His Ala Asp 


Val 










275 




Gly 


Glu 


Ser 


Val 


Leu Asn Asp 


Ala 










290 




Ser 


He 


Val 


Ala 


Tyr Gin Pro 


Ala 










305 




Asp Ala Ala 


Ala 


Phe Phe Lys 


Ser 










320 




Phe 


Ser Gly 


Ser 


Phe Thr Met 


Gly 










335 




Ala 


Leu 


Val 


Thr 


Lys Phe Thr 


Lys 










350 


Glu 


Thr 


Ala 


Leu 


Phe Phe Leu 


Met 










365 




Ala 


Glu 


Ala 


Cys 


Gly Phe Thr 


Gly 










380 


Gly 


He 


Thr 


Gin 


Ala His Tyr 


Thr 










395 




Ser 


Arg 


Ser 


Arg 


Thr Lys Gin 


Leu 










410 




Ala 


Glu 


Asn 


Phe 


He Phe Ser 


Tyr 










425 




Phe 


Gin 


Lys 


His 


Val Phe Ser 


Pro 










440 




Val 


Ala 


He 


Phe 


Leu Gly Arg 


Ala 










455 




Phe 


Phe 


Leu 


Asn 


Leu Gly Arg 


Arg 










470 




Gin 


His 


Met 


Met 


Met Phe Ser 


Gly 










485 




Ala 


Leu 


Ala 


He 


Arg Asp Thr 


Ala 










500 





40 










45 


Met Glu 


Glu 


Leu 


Ala 


Thr 


Glu 


55 










60 


Gin Asp 


Ser 


Val 


Ser 


Leu 


Leu 


70 










75 


Thr He 


Leu 


Thr 


He 


Trp 


Leu 


85 










90 


Leu His 


Glu 


Thr 


Gly 


Leu 


Ala 


100 










105 


Val He 


Leu 


Arg 


Tyr 


Gly Thr 


115 










120 


Ser Leu 


Ser 


Cys 


Thr 


Gin 


Glu 


130 










135 


Val Asn 


Val 


Ser 


Gly 


Lvs 


Phe 


145 










150 


He Ser 


Pro 


Gly 


Lys 


He 


Asn 


160 










165 


Arg Lys 


Val 


Thr 


Phe 


Aso 


Pro 


175 








180 


Pro Pro 


He 


He 


Phe 


His 


Ala 


190 










195 


Phe Phe 


Arg 


Asn 


Leu 


Gly 


Ser 


205 










210 


Thr Ala 


Val 


Ser 


Cys 


Phe 


He 


220 










225 


Val Lys 


Leu 


Met 


Lys 


He 


Met 


235 










240 


Tyr Thr 


Asp 


Cys 


Leu 


Phe 


Phe 


250 










255 


Pro Val 


Thr 


Val 


Leu 


Ala 


He 


265 










270 


Asp Leu 


Tyr 


Ala 


Leu 


Leu 


Phe 


280 










285 


Val Ala 


He 


Val 


Leu 


Ser 


Ser 


295 










300 


Gly Leu Asn 


Thr 


His 


Ala 


Phe 


310 










315 


Val Gly 


He 


Phe 


Leu 


Gly 


He 


325 










330 


Ala Val 


Thr 


Gly Val 


Val 


Thr 


340 










345 


Leu His 


Cys 


Phe 


Pro 


Leu 


Leu 


355 










360 


Ser Trp 


Ser 


Thr 


Phe 


Leu 


Leu 


370 










375 


Val Val 


Ala 


Val 


Leu 


Phe 


Cys 


385 










390 


Tyr Asn 


Asn 


Leu 


Ser 


Val 


Glu 


400 










405 


Phe Glu 


Val 


Leu 


His 


Phe 


Leu 


415 










420 


Met Gly 


Leu 


Ala 


Leu 


Phe 


Thr 


430 










435 


He Phe 


He 


He Gly Ala 


Phe 


445 










450 


Ala His 


He 


Tyr 


Pro 


Leu 


Ser 


460 










465 


His Lys 


He 


Gly Trp Asn 


Phe 


475 










480 


Leu Arg Gly Ala Met 


Ala 


Phe 


490 










495 


Ser Tyr 


Ala 


Arg 


Gin 


Met 


Met 


505 










510 
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Phe 


Thr 


Thr Thr 


Leu 


Leu 


He 


Val Phe Phe Thr Val Trp He 


He 








515 






520 


525 


Gly 


Gly 


Gly Thr 


Thr 


Pro 


Met 


Leu Ser Trp Leu Asn He Arg 


Val 








530 






535 


540 


Gly 


Val 


Glu Glu 


Pro 


Ser 


Glu 


Glu Asp Gin Asn Glu His His 


Trp 








545 






550 


555 


Gin 


Tyr 


Phe Arg 


Val 


Gly 


Val 


Asp Pro Asp Gin Asp Pro Pro 


Pro 








560 






565 


570 


Asn 


Asn 


Asp Ser 


Phe 


Gin 


Val 


Leu Gin Gly Asp Gly Pro Asp 


Ser 








575 






580 


585 


Ala 


Arg 


Gly Asn 


Arg 


Thr 


Lys 


Gin Glu Ser Thr Trp He Phe 


Arg 








590 






595 


600 


Leu 


Trp 


Tyr Ser 


Phe 


Asp 


His 


Asn Tyr Leu Lys Pro He Leu 


Thr 








605 






610 


615 


His 


Ser 


Gly Pro 


Pro 


Leu 


Thr 


Thr Thr Leu Pro Ala Trp Cys 


Gly 








620 






625 


630 


Leu 


Leu 


Ala Arg 


Cys 


Leu 


Thr 


Ser Pro Gin Val Tyr Asp Asn 


Gin 








635 






640 


645 


Glu 


Pro 


Leu Arg 


Glu 


Glu 


Asp 


Ser Asp Phe He Leu Thr Glu 


Gly 








650 






655 


660 


Asp 


Leu 


Thr Leu 


Thr 


Tyr 


Gly 


Asp Ser Thr Val Thr Ala Asn 


Glv 








665 






670 


675 


Ser 


Ser 


Ser Ser 


His 


Thr 


Ala 


Ser Thr Ser Leu Glu Gly Ser 


Arg 








660 






685 


690 


Arg 


Thr 


Lys Ser 


Ser 


Ser 


Glu 


Glu Val Leu Glu Arg Asp Leu 


Gly 








695 






700 


705 


Met 


Gly 


Asp Gin 


Lys 


Val 


Ser 


Ser Arg Gly Thr Arg Leu Val 


Phe 








710 






715 


720 


Pro 


Leu 


Glu Asp 


Asn 


Ala 














725 
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Met 


Glu 


Ser Ala 


Gly 


Leu Glu Gin 


1 






5 




Pro 


Asp 


Thr Glu 


Arg 
20 


He Arg Arg 


Val 


Leu 


Arg Ala 


Pro 
35 


Ala Ala Leu 


Ala 


Ser 


Ala Ala 


Asp 
50 


Pro Gin lie 


Thr 


Arg 


Arg Arg 


Leu 
65 


Asn Thr Arg 


Gin 


Arg 


Glu Ser 


Leu 
80 


Lys Ser Leu 


Glu 


Thr 


Glu His 


Cys 
95 


Val Ser Leu 


Thr 


lie 


Phe Arg 


Lys 
110 


Glu Gly Leu 


Gin 


Leu 


Leu Gin 


His 
125 


Ser Thr His 


Glu 


Met 


Gly Leu 


Leu 
140 


Leu Leu Ser 


Glu 


Ala 


Phe Gin 


Pro 
155 


His His Arg 



Leu 


Leu Arg Glu Leu Leu 


Leu 




10 




15 


Ala 


Thr 


Glu Gin Leu Gin 


He 




25 




30 


Pro 


Ala 


Leu Cys Asp Leu 


Leu 




40 




45 


Arg 


Gin 


Phe Ala Ala Val 


Leu 




55 




60 


Trp Arg Arg Leu Ala Ala 


Glu 




70 




75 


lie 


Leu 


Thr Ala Leu Gin 


Arg 




85 




90 


Ser 


Leu 


Ala Gin Leu Ser 


Ala 




100 




105 


Glu 


Ala 


Trp Pro Gin Leu 


Leu 




115 




120 


Ser 


Pro 


His Ser Pro Glu 


Arg 




130 




135 


Val 


Val 


Val Thr Ser Arg 


Pro 




145 




150 


Glu 


Leu 


Leu Arg Leu Leu 


Asn 




160 




165 
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Glu 


Thr 


Leu 


Leu 


Arg 


Thr 


Val 


Pro 


Leu 


Gin 


Thr 


Leu 


Glu 


Ala 


Leu 


Pro 


Tyr 


Leu 


Asn 


Val 


Ala 


Leu 


Thr 


Phe 


Arg 


Leu 


Leu 


Ala 


Glu 


Pro 


Glu 


Glu 


Glu 


His 


Phe 


Ala 


Pro 


Glu 


Lys 


Leu 


Arg 


Ser 


Leu 


Ala 


Val 


Leu 


Leu 


Pro 


Pro 


Ser 


Gin 


Phe 


Ser 


Glu 


Val 


Met 


Pro 


His 


Thr 


His 


Val 


Glu 


Asn 


Met 


Glu 


Cys 


Ala 


Lys 


Glu 


Ala 


Gin 


Ala 


Leu 


Arg 


Glu 


Gin 


He 


Gin 


Gly 


Glu 


Pro 


Leu 


Gly 


Leu 


Thr 


Tyr 


Ser 


Leu 


Ala 


Pro 


Leu 


Arg 


Ser 


Ser 


Phe 


Leu 



Gly 


Glu 


Val 




170 




Leu 


Thr 


Thr 




185 




Ala 


Arg 


Met 




200 




He 


Pro 


He 




215 




Asp 


Glu 


Leu 




230 




Ser 


Glu 


Val 




245 




Leu 


Gly 


Asn 




260 




Leu 


Val 


Lys 




275 




Pro 


Pro 


Leu 




290 




Pro 


Pro 


Gly 




305 




Glu 


Leu 


Glu 




320 




Val 


Gin 


Val 




335 




Leu 


Cys 


Pro 




350 




Glu 


Ser 


Pro 




365 




Leu 


Ser 


Asp 




380 




Pro 


Leu 


Leu 




395 




Val 


Val 


Arg 




410 




Asn 


Leu 


Gin 




425 




Leu 


Leu 


Leu 




440 




His 


Leu 


Ala 




455 




Leu 


Gly 


Pro 




470 




Met 


Leu 


Gin 




485 




Leu 


Ala 


Val 




500 




Ser 


Leu 


Leu 




515 




Phe 


Leu 


Leu 




530 




Ser 


Leu 


Glu 




545 




Met 


Arg 


Pro 




560 




Cys 


Asp 


Gin 




575 




Leu 


Phe 


Ala 




590 




His 


Leu 


Glu 




605 




Thr 


Glu 


Gly 




620 




Leu 


Phe 


Asp 



Gly 


Ser 


Pro 


Met 


Ala 


Pro 


Leu 


Val 


Pro 


Asp 


Glu 


Ala 


Leu 


Glu 


Ser 


Leu 


Thr 


Phe 


Ala 


He 


Arg 


Val 


Lys 


Ser 


Leu 


His 


Thr 


Gin 


Leu 


Asp 


He 


Glu 


Leu 


Val 


Asp 


Met 


Gin 


Leu 


Met 


Tyr 


Gin 


Arg 


Gly 


Ala 


Gly 


Gin 


He 


Val 


Asn 


Ala 


Ala 


Pro 


His 


He 


Ala 


Tyr 


Leu 


Lys 


Ala 


Cys 


Lys 


Val 


Gin 


Leu 


Leu 


Arg 


Ser 


Ala 


Leu 


Pro 


Tyr 


Phe 


Thr 


Gly 


Arg 


Thr 


Leu 


Gly 


Leu 


Ala 


Glu 


Val 


Asp 


Asp 


Ala 


Leu 


Ser 


Gin 


He 


Thr 


He 


Val 


Pro 


Asp 


Glu 


Ser 



Gly 


Leu 


Leu 


175 






Tyr 


Leu 


Ser 


190 






Lys 


Leu 


He 


205 






Lys 


Ala 


Cys 


220 






Glu 


Val 


Pro 


235 






Cys 


Leu 


Glu 


250 






He 


Arg 


He 


265 






Lys 


Ala 


Leu 


280 






Leu 


Phe 


Pro 


295 






Pro 


Glu 


Asp 


310 






Met 


Gly 


Glu 


325 






Leu 


Ala 


Leu 


340 






Pro 


Met 


Leu 


355 






Lvs 


Ala 


Glv 


370 






Asp 


His 


He 


385 






Cys 


Lys 


Gly 


400 






Leu 


Phe 


Ala 


415 






Ser 


Ser 


Tvr 


430 






Lvs 


Ser 


Val 


445 






Tyr 


Ala 


Leu 


460 






Pro 


Tvr 


Leu 


475 






Asn 


Pro 


Ser 


490 






Gly 


Ala 


He 


505 






Pro 


Ala 


He 


520 






Glu 


Asp 


Leu 


535 






Val 


Leu 


Ala 


550 






Glu 


Cvs 


Cys 


565 






Ala 


Asp 


Leu 


580 






Gly 


Leu 


Met 


595 






Thr 


Leu 


Met 


610 






Gin 


Tyr 


Asp 


625 






Asp 


Gly 


Glu 
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Phe 


Tyr 


Ser 






180 


Thr 


Glu 


Asd 






195 


Met 


Ala 


Met 






210 


Glu 


Ala 


Leu 






225 


Val 


He 


Thr 






240 


Val 


Ala 


Arg 






255 


Leu 


Cvs 

^ Jr " 


Cvs 

v- jr i=> 






270 


Leu 


Lvs 


Asn 






285 


He 


Val 


Ala 






300 


Gin 


Asp 


Ser 






315 


Thr 


Pro 


Lys 






330 


His 


Leu 


Pro 






345 


Glu 


Glu 


Ala 






360 


Leu 


Leu 


Val 






375 


Ara 


Gin 


Arg 






390 


Leu 


Glu 


Asp 






405 


Leu 


Glv 

Jr 


Gin 






420 


Ser 


Arg 


Glu 






435 


Pro 


Leu 


Glv 






450 


Glu 


Asn 


Phe 






465 


Pro 


Glu 


Leu 






480 


Ser 


Pro 








495 


Ala 


Thr 


Ala 






510 


Met 


Glu 


His 






525 


Gin 


Pro 


Val 






540 


Arg 


Ala 


Val 






555 


Gin 


Leu 


Glv 






570 


Ara 


Arg 


Cvs 






585 


Glv 


Glu 


Glv 

UA Jr 






600 


Leu 


Leu 


Ser 






615 


Gly 


Ser 


Ser 






630 


Glu 


Glu 


Glu 
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635 






Glu 


Leu 


Met 


Asp 


Glu 


Asp Val 


Glu 










650 






Ser 


Glv 


Tyr 


Ser 


Val 


Glu Asn 


Ala 










665 






Thr 


Cvs 


Ala 


Ala 


Val 


Gly Glu 


He 










680 






Phe 


Leu 


Pro 


Tyr 


Met 


Glu Ser 


Val 










695 






Leu 


Glu 


Cvs 


Pro 


His 


Leu Asn 


Val 










710 






Leu 


Glv 


Gin 


Phe 


Cvs 

Jr a 


Cys Ala 


Leu 










725 






Pro 


Ser 


Glu 


Pro 


Asn 


Thr Ala 


Ala 










740 






Val 


Val 


Pro 


Ser 


Tvr 

X Jf X. 


Met Gin 


Ala 










755 






Gin 


Val 


Val 


Met 


Ala 


Val Leu 


Glu 










770 






Ser 


Cvs 


Glv 


Thr 


Leu 


Thr Leu 


Lys 










785 






Leu 


Cys 


Glv 


Val 


Leu 


Lys Ala Val 










800 






Gin 




Thr 


Asp 


Glu 


Glu Glu 


Glu 










815 






Glu 


Tvr 

* Jr * 


Asp 


Ala 


Met 


Leu Leu 


Glu 










830 






Ala 


Leu 


Ala 


Ala 


Ala 


Ala Gly Gly 










845 






Ala 


Glv 


Phe 


Leu 


Pro 


Leu Leu 


Val 










860 






Thr 


Val 


Ala 


Glu 


Lys 


Ser Phe 


Ala 










875 






lie 


Gin 


Glv 


Leu 


Glv 


Ala Ala 


Ser 










890 






Leu 


Pro 


Val 


Leu 


Leu 


Ser Thr 


Ala 










905 






Arcr 


Ser 


Asn 


Ala 


He 


Phe Gly Met 










920 






Glv 


His 


Pro 


Ala 


Gin 


Glu His 


Phe 










935 






Phe 


Pro 


Leu 


Leu 


Ala 


Arg Glu Arg 










950 






He 


Cys 


Glv 


Ala 


Leu 


Ala Arg Leu 










965 






Lvs 


Pro 


Glu 


Pro 


Gin 


Val Leu 


Ala 










980 






Leu 


Lvs 


Glu 


Asp 


Leu 


Glu Glu 


Trp 










995 






Ser 


Phe 


Leu 


Tyr 


Gin 


Ser Ser 


Pro 








1010 






Pro 


Glu 


Leu 


Leu 


Arg 


He Cys 


Ser 








1025 






He 


Pro 


Pro 


Asp 


Thr 


Lys Ala 


Ala 








1040 






Leu 


Ala 


Lys 


Gin 


His 


Thr Asp 


Ser 








1055 






Leu 


Pro 


Val 


Asp 


Lys 


Ala Gin 


Glu 








1070 






Ser 
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640 








645 


Glu 


Glu 


Asp 


Asp 


Ser 


Glu He 




655 








660 


Phe 


Phe 


Asp 


Glu 


Lvs 


Glu Asp 




670 








675 


Ser 


Val 


Asn 


Thr 


Ser 


Val Ala 




685 








690 


Phe 


Glu 


Glu 


Val 


Phe 


Lys Leu 




700 








705 


Arg 


LVS 


Ala 


Ala 


His 


Glu Ala 




715 








720 


His 


LVS 

■"Jr 


Ala 


Cys 


Gin 


Ser Cys 




730 








735 


Leu 


Gin 


Ala 


Ala 


Leu 


Ala Arg 




745 








750 


Val 


Asn 




Glu 


21 V<T 


Gl n 7±m 




760 










Ala 


Leu 


Thr 


(11 V 


Va I 

val 


T ,CO 1 21 TTX 




775 








7R0 


Pro 


Pro 


Glv 




Leu 


Ala Glu 




790 








795 


Leu 


Gin 




Lys 


Thr 






805 








810 


Glu 


Glu 


Asp 


Asp 


Asp 


Gin Ala 




820 








825 


His 


Ala 


Glv 


Glu 


Ala 


He Pro 




835 








840 


Asp 


Ser 


Phe 


Ala 


Pro 


Phe Phe 




850 








855 


Cys 


Lys 


Thr 


Lys 


Gin 


Gly Cys 




865 








870 


Val 


Glv 


Thr 


Leu 


Ala 


Glu Thr 




880 








885 


Ala 


Gin 


Phe 


Val 


Ser 


Arg Leu 




895 








900 

j \j \j 


Gin 


Glu 


Ala 


7v en 


Prn 

XT X \J 


Glu Val 
ox u val 




910 








915 

-7 X -J 


Glv 


Val 


Leu 


Ala 


Glu 


His Glv 




925 








930 


Pro 


Lys 


Leu 


Leu 


Glv 


JJCU ucu 




940 








945 


His 


Asp 


Arg 


Val 


Arg 


Sen Asn 
no^/ n oil 




955 








960 


Leu 


Met 


Ala 


Ser 


Pro 


Thr Arg 




970 








975 


Ala 


Leu 


Leu 


His 


Ala 


Leu Pro 




985 








990 


Val 


Thr 


He 


Glv 


Arcr 


Leu Phe 


1000 








1005 


Asp 


Gin 


Val 


He 


Asd 


Val Ala 


1015 








1020 


Leu 


He 


Leu 


Ala 


Asp 


Asn Lys 


1030 








1035 


Leu 


Leu 


Leu 


Leu 


Leu 


Thr Phe 


1045 








1050 


Phe 


Gin 


Ala 


Ala 


Leu 


Gly Ser 


1060 








1065 


Leu 


Gin 


Ala 


Val 


Leu 


Gly Leu 


1075 








1080 
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<400> 3 



Met 


Tyr 


Val 


Arg Val 


Ser 


Phe 


Asp 


1 








5 






His 


Leu 


Met 


Thr 


Lys 


Glu 


Trp 


Gin 










20 








He 


Ser 


Val 


His 


Gly 


Gly 


Leu 


Gin 










35 








Leu 


Lys 


Gin 


Val 


Phe Gly Lys 


Gly 










50 








Thr 


Gly Ala Trp lie 


Phe 


Thr 


Gly 










65 






Arg 


His 


Val 


Gly Asp Ala Leu 


Lys 










80 








Gly 


Lys 


He 


Cys 


Thr 


He Gly 


He 










95 








Asn 


Gin Glu Asp Leu 


He Gly Arg 










110 








Thr 


Met 


Ser 


Asn 


Pro 


Met 


Ser 


Lys 










125 






His 


Ser 


His 


Phe 


He 


Leu 


Ala 


Asp 










140 






Gly Ala 


Glu 


Val 


Lys 


Leu 


Arg 


Arg 










155 








Leu 


Gin 


Lys 


He 


Asn 


Thr 


Arg 


He 










170 








Ala 


Leu 


He 


Val 


Glu 


Gly Gly 


Pro 










185 








Glu 


Tyr 


Leu 


Arg 


Asp 


Thr 


Pro 


Pro 










200 








Gly 


Ser 


Gly Arg Ala 


Ser 


Asp 


He 










215 








Ser 


Glu 


Glu Gly Gly Leu 


He 


Asn 










230 








Leu 


Val 


Thr 


He 


Gin 


Lys 


Thr 


Phe 










245 








Gin 


His 


Leu 


Phe 


He 


He 


Leu 


Met 










260 








Leu 


He 


Thr 


Val 


Phe 


Arg 


Met 


Gly 










275 






Asp 


Leu 


Ala 


He 


Leu 


Thr 


Ala 


Leu 










290 








Ala 


Pro 


Asp 


Gin 


Leu 


Ser 


Leu 


Ala 










305 








He 


Ala 


Arg 


Ser 


Gin 


He 


Phe 


He 










320 








Gly 


Ser 


Leu 


Glu 


Gin 


Ala 


Met 


Leu 










335 








Val 


Asp 


Phe 


Val 


Lys 


Leu 


Leu 


He 










350 








Arg 


Phe 


Leu 


Thr 


He 


Ser Arg 


Leu 










365 








His 


Gly 


Pro 


Ser 


Asn 


Thr 


Leu 


Tyr 










380 








Lys 


Gly Asn Leu 


Pro 


Pro Asp 


Tyr 










395 









Thr Lys 


Pro 


Asp 


Leu 


Leu 


Leu 


10 










15 


Leu Glu 


Leu 


Pro 


Lys 


Leu 


Leu 


25 










30 


Asn * Phe 


Glu 


Leu 


Gin 


Pro 


Lys 


40 










45 


Leu He 


Lys 


Ala 


Ala 


Met 


Thr 


55 










60 


Gly Val 


Asn 


Thr 


Gly Val 


He 


70 










75 


Asp His 


Ala 


Ser 


Lys 


Ser 


Arg 


85 










90 


Ala Pro 


Trp 


Gly 


He 


Val 


Glu 


100 










105 


Asp Val 


Val 


Arg 


Pro 


Tyr 


Gin 


115 










120 


Leu Thr 


Val 


Leu 


Asn 


Ser 


Met 


130 










135 


Asn Gly 


Thr 


Thr 


Gly 


Lys 


Tyr 


145 










150 


Gin Leu 


Glu 


Lys 


His 


He 


Ser 


160 










165 


Gly Gin 


Gly Val 


Pro 


Val 


Val 


175 










180 


Asn Val 


He 


Ser 


lie 


Val 


Leu 


190 










195 


Val Pro 


Val 


Val 


Val 


Cys 


Asp 


205 










210 


Leu Ala 


Phe 


Gly His 


Lys 


Tyr 


220 










225 


Glu Ser 


Leu 


Arg 


Asp 


Gin 


Leu 


235 










240 


Thr Tyr 


Thr 


Arg 


Thr 


Gin 


Ala 


250 










255 


Glu Cys 


Met 


Lys 


Lys 


Lys 


Glu 


265 










270 


Ser Glu 


Gly His 


Gin Asp 


He 


280 










285 


Leu Lys 


Gly Ala 


Asn 


Ala 


Ser 


295 










300 


Leu Ala 


Trp 


Asn 


Arg 


Val 


Asp 


310 










315 


Tyr Gly Gin Gin Trp Pro Val 


325 










330 


Asp Ala 


Leu 


Val 


Leu 


Asp 


Arg 


340 










345 


Glu Asn 


Gly Val 


Ser 


Met 


His 


355 










360 


Glu Glu 


Leu 


Tyr 


Asn 


Thr 


Arg 


370 










375 


His Leu 


Val 


Arg Asp Val 


Lys 


385 










390 


Arg He 


Ser 


Leu 


He 


Asp 


He 


400 










405 
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Gly Leu 


Val 


lie Glu Tyr Leu Met 


Gly Gly Ala Tyr 


Arg 


Cys 


Asn 










410 


415 






420 


Tyr 


Thr 


Arg Lys Arg Phe Arg Thr 


Leu Tyr His Asn 


Leu 


Phe 


Gly 










425 


430 






435 


Pro 


Lys 


Arg 


Pro 


Lys Ala Leu Lys 


Leu Leu Gly Met 


Glu 


Asp 


Asp 










440 


445 






450 


He 


Pro 


Leu Arg Arg Gly Arg Lys 


Thr Thr Lys Lys 


Arg 


Glu 


Glu 










455 


460 






465 


Glu 


Val 


Asp He Asp Leu Asp Asp 


Pro Glu lie Asn 


His 


Phe 


Pro 










470 


475 






480 


Phe 


Pro 


Phe 


His 


Glu Leu Met Val 


Trp Ala Val Leu 


Met 


Lys 


Arcr 










485 


490 




495 


Gin 


Lys 


Met 


Ala 


Leu Phe Phe Trp 


Gin His Gly Glu 


Glu 


Ala 


Met 










500 


505 






510 


Ala 


Lys 


Ala 


Leu 


Val Ala Cys Lys 


Leu Cys Lys Ala 


Met 


Ala 


His 










515 


520 






525 


Glu 


Ala 


Ser 


Glu 


Asn Asp Met Val 


Asp Asp lie Ser 


Gin 


Glu 


Leu 










530 


535 






540 


Asn 


His 


Asn 


Ser 


Arg Asp Phe Gly 


Gin Leu Ala Val 


Glu 


Leu 


Leu 










545 


550 






555 


Asp 


Gin 


Ser Tyr Lys Gin Asp Glu 


Gin Leu Ala Met 


Lvs 


Leu 


Leu 










560 


565 




570 


Thr 


Tvr 


Glu 


Leu 


Lys Asn Trp Ser 


Asn Ala Thr Cys 


Leu 


Gin 


Leu 










575 


580 






585 


Ala 


Val 


Ala Ala Lys His Arg Asp 


Phe lie Ala His 


Thr 


Cvs 


Ser 










590 


595 






600 


Gin 


Met 


Leu 


Leu 


Thr Asp Met Trp 


Met Gly Arg Leu 


Arcr 


Met 


Arcr 










605 


610 






615 


Lys 


Asn 


Ser Gly Leu Lys Val He 


Leu Gly lie Leu 


Leu 


Pro 


Pro 










620 


625 






630 


Ser 


He 


Leu 


Ser 


Leu Glu Phe Lys 


Asn Lys Asp Asp 


Met 


Pro 


Tvr 










635 


640 






645 


Met 


Ser 


Gin 


Ala 


Gin Glu lie His 


Leu Gin Glu Lys 


Glu 


Ala 


Glu 










650 


655 






660 


Glu 


Pro 


Glu 


Lys 


Pro Thr Lys Glu 


Lys Glu Glu Glu 


Asp 


Met 


Glu 










665 


670 




675 


Leu 


He 


Ala 


Met 


Leu Gly Arg Asn 


Asn Gly Glu Ser 


Ser 


Arcr 


Lys 










680 


685 






690 


Lys 


Asp 


Glu 


Glu 


Glu Val Gin Ser 


Glu His Arg Leu 


lie 


Pro 


Leu 










695 


700 






705 


Gly Arg 


Lys 


He 


Tyr Glu Phe Tyr 


Asn Ala Pro lie 


Val 


Lys 


Phe 










710 


715 






720 


Trp 


Phe 


Tyr 


Thr 


Leu Ala Tyr lie 


Gly Tyr Leu Met 


Leu 


Phe 


Asn 










725 


730 






735 


Tyr 


He 


Val 


Leu 


Val Lys Met Glu 


Arg Trp Pro Pro 


Thr 


Gin 


Glu 










740 


745 






750 


Trp 


He 


Val 


He 


Ser Tyr lie Phe 


Thr Leu Gly lie 


Glu 


Lvs 


Met 










755 


760 




765 


Arg 


Glu 


He 


Leu 


Met Ser Glu Pro 


Gly Lys Leu Leu 


Gin 


Lys 


Val 










770 


775 






780 


Lys Val 


Trp Leu 


Gin Glu His Trp 


Asn Val Thr Asp 


Leu 


He 


Ala 










785 


790 






795 


He 


Leu 


Leu Phe Ser Val Gly Met 


lie Leu Arg Leu 


Gin 


Asp 


Gin 










800 


805 




810 


Pro 


Phe 


Arg Ser Asp Gly Arg Val 


lie Tyr Cys Val 


Asn 


He 


He 










815 


820 






825 


Tyr Trp 


Tyr He Arg Leu Leu Asp 


lie Phe Gly Val 


Asn 


Lys 


Tyr 










830 


835 






840 


Leu Gly 


Pro Tyr Val Met Met He 


Gly Lys Met Met 


He 


Asp 


Met 










845 


850 




855 


Met 


Tyr 


Phe 


Val 


lie He Met Leu 


Val Val Leu Met 


Ser 


Phe 


Gly 










860 


865 






870 


Val 


Ala 


Arg Gin Ala lie Leu Phe 


Pro Asn Glu Glu 


Pro 


Ser 


Trp 
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875 










880 








885 


Lys 


Leu 


Ala 


Lys 


Asn 


He 


Phe 


Tyr 


Met 


Pro 


Tyr 


Trp 


Met 


lie Tyr 










890 










895 








900 


Gly 


Glu 


Val 


Phe 


Ala 


Asp 


Gin 


He 


Asp 


Pro 


Pro 


Cys 


Gly Gin Asn 










905 










910 








915 


Glu 


Thr 


Arg 


Glu 


Asp 


Gly 


Lys 


He 


lie 


Gin 


Leu 


Pro 


Pro 


Cys Lys 










920 










925 








930 


Thr 


Gly 


Ala 


Trp 


He 


Val 


Pro 


Ala 


lie 


Met 


Ala 


Cys 


Tyr 


Leu Leu 










935 










940 




945 


Val 


Ala 


Asn 


He 


Leu 


Leu 


Val 


Asn 


Leu 


Leu 


He 


Ala 


Val 


Phe Asn 










950 










955 








960 


Asn 


Thr 


Phe 


Phe 


Glu 


Val 


Lys 


Ser 


He 


Ser 


Asn 


Gin 


Val 


Trp Lys 










965 










970 








975 


Phe 


Gin 


Arg 


Tyr 


Gin 


Leu 


He 


Met 


Thr 


Phe 


His 


Glu 


Arg 


Pro Val 










980 










985 








990 


Leu 


Pro 


Pro 


Pro 


Leu 


He 


He 


Phe 


Ser 


His 


Met 


Thr 


Met 


He Phe 










995 








1000 








1005 


Gin 


His 


Leu 


Cys 


Cys 


Arg 


Trp 


Arg 


Lys 


His 


Glu 


Ser 


Asp 


Pro Asp 








1010 








1015 








1020 


Glu 


Arg 


Asp 


Tyr 


Gly 


Leu 


Lys 


Leu 


Phe 


He 


Thr 


Asp Asp 


Glu Leu 








1025 








1030 








1035 


Lys 


Lys 


Val 


His 


Asp 


Phe 


Glu 


Glu 


Gin 


Cys 


lie 


Glu 


Glu 


Tyr Phe 








1040 








1045 








1050 


Arg 


Glu 


Lys 


Asp 


Asp 


Arg 


Phe 


Asn 


Ser 


Ser 


Asn 


Asp 


Glu 


Arg lie 








1055 








1060 






1065 


Arg 


Val 


Thr 


Ser 


Glu 


Arg 


Val 


Glu 


Asn 


Met 


Ser 


Met 


Arg 


Leu Glu 








1070 








1075 






1080 


Glu 


Val 


Asn 


Glu 


Axg 


Glu 


His 


Ser 


Met 


Lys 


Ala 


Ser 


Leu 


Gin Thr 








1085 








1090 








1095 


Val 


Asp 


He 


Arg 


Leu 


Ala 


Gin 


Leu 


Glu 


Asp 


Leu 


He 


Gly 


Arg Met 








1100 








1105 






1110 


Ala 


Thr 


Ala 


Leu 


Glu 


Arg 


Leu 


Thr 


Gly 


Leu 


Glu 


Arg 


Ala 


Glu Ser 








1115 








1120 








1125 


Asn 


Lys 


He 


Arg 


Ser 


Arg 


Thr 


Ser 


Ser 


Asp 


Cys 


Thr 


Asp 


Ala Ala 








1130 








1135 








1140 


Tyr 


He 


Val 


Arg 


Gin 


Ser 


Ser 


Phe 


Asn 


Ser 


Gin 


Glu 


Gly Asn Thr 








1145 








1150 








1155 


Phe 


Lys 


Leu 


Gin 


Glu 


Ser 


lie 


Asp 


Pro 


Ala 


Glu 


His 


Pro 


Leu Tyr 








1160 








1165 








1170 


Ser 


Val 
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Met 


Glu 


lie 


Ala 


Gly 


Glu Lys 


Lys 


1 








5 






Ser 


Ser 


Leu 


Leu 


Arg 


Ser Leu 


Ala 










20 






Val 


Gin 


Arg 


Gly 


Ala 


Leu Leu 


Glu 










35 






Ala 


Thr 


Asn 


Val 


Arg 


Asn Asp 


Gin 










50 






Ser 


He 


Val 


Val 


Ser 


Gly Val 


Ser 










65 






Lys 


Tyr 


Gly Gin 


Phe 


Leu Leu 


Val 



Tyr 


Gin 


Tyr 


Tyr 


Ser 


Met 


Pro 




10 










15 


Gly Gly 


Arg 


His 


Pro 


Pro 


Arg 




25 










30 


He 


Thr 


Asn 


Ser 


Lys 


Arg 


Glu 




40 










45 


Glu 


Arg 


Gin 


Glu 


Thr 


Gin 


Ser 




55 










60 


Pro 


Asn 


Arg 


Gin 


Ala 


His 


Ser 




70 










75 


Pro 


Ser 


Asn 


Leu 


Lys 


Arg 


Val 
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80 


Pro 


Phe 


Ser 


Ala 


Glu 










95 


Pro 


Asp Val 


Leu 


Ala 










110 


Val 


His 


Leu 


Thr 


Gly 










125 


Val 


Asp 


Glu 


Pro 


Lys 










140 


Asp 


Gly Gin 


Asp 


Lys 










155 


Thr 


Pro 


Glu 


Pro 


Met 










170 


He 


Met 


Gin 


Thr Gly 










185 


Gly 


Asp Glu 


Thr 


Gly 










200 


Cys 


Leu 


Pro 


Ser 


Thr 










215 


His 


Gin 


Lys 


Gin 


Ala 










230 


Arg 


Ser 


Ala 


Gin 


Ser 










245 


Gin 


Pro 


Asn 


Thr 


Ser 










260 


Pro 


Phe 


Ala 


Arg 


Ser 










275 


Leu 


Met 


Lys 


He 


Cys 










290 


He 


Gly Gin 


Gly Ala 










305 


Arg 


Arg 


Leu 


Ser 


Lys 










320 


Pro 


He 


He 


Thr 


Val 










335 


Pro 


Glu 


Pro 


Gly Glu 










350 


Asn 


Trp 


Phe 


Thr 


Val 










365 


Tyr 


He 


His 


Gin 


Lys 










380 


He 


Gin 


He 


Ser 


Asp 










395 


Asp 


Leu 


Ser 


Lys 


Asn 










410 


Asn 


Arg 


Asn 


Ser 


Leu 










425 


Leu 


Lys 


Ser 


Pro 


Gin 










440 


Leu 


Phe 


Ala 


Ala Gly 










455 


Ser 


Ser 


Pro 


Leu 


Asn 










470 


Arg 


Gly Arg 


Ala 


Ala 










485 


Asp 


Gly Gly 


Leu 


Arg 










500 


Glu 


Asp Asp 


He 


Leu 










515 


Phe 


Leu 


Pro 


Glu Val 










530 


Ser 


Thr 


Val 


Leu 


His 










545 











85 


Thr 


Val 


Leu 


Pro 


Leu 
100 


Thr 


Glu 


Gin 


Asp 


He 
115 


Gin 


Thr 


Pro 


Val 


Val 
130 


Glu 


Lys 


His 


Glu 


Pro 
145 


Ala 


Glu 


Gin 


Val 


Leu 
160 


Thr 


Met 


Ser 


Ser 


Pro 
175 


Gly 


Gly 


Tyr 


Val 


Asn 
190 


Val 


Phe 


Ser 


He 


Lys 
205 


Cvs 


ASD 


Ser 


Asp 


Ser 
220 


Gin 


Asp 


Ser 


Ser 


Leu 
235 


Ser 


Glu 


Cvs 


Ser 


Glu 
250 


Phe 


Trp 


He 


Asn 


Pro 
265 


His 


Ser 


Phe 


Arc 


Phe 
280 


Lys 


He 


Lvs 


Asn 


Leu 
295 


Trp 


Val 


LVS 


Ala 


Lvs 
310 


Lvs 


Lvs 


Lvs 


Asn 


Thr 
325 


Asn 


Ala 


Cvs 
v» j «j 


Ser 


Gin 
340 


Asn 


Ser 


He 


Ser 


Glu 
355 


Ser 


Lvs 


Phe 


Ser 


His 
370 


Met 


Lvs 


Thr 


Lys 


Glu 
385 


Tyr 


Leu 


Lvs 


Gin 


Ser 
400 


Ser 


Leu 


Trp 


Asn 


Ser 
415 


Ser 


Ser 


Leu 


He 


Ser 
430 


Glu 


Pro 


His 


His 


His 
445 


Glu 


Glu 


He 


Thr 


Val 
460 


Leu 


Asp 


Lys 


Ser 


Met 
475 


Met 


He 


Gin 


Val 


Leu 
490 


Lys 


Ala 


Met 


Arg Val 










505 


Lys 


Pro 


Gly 


Gin 


Val 
520 


Val 


Arg 


Thr 


Trp His 










535 


Leu 


Cys 


Leu 


Arg 


Glu 
550 
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90 


Ser 


Arg 


Pro 


Ser 


Val 
105 


Gin 


Thr 


Glu 


Val 


Leu 
120 


Ser 


Asp 


Trp 


Ala 


Ser 
135 


He 


Ala 


His 


Leu 


Leu 
150 


Pro 


Thr 


Leu 


Ser 


Cys 
165 


Leu 


Ser 


Gin 


Ala 


Lys 
180 


TrD 


Ala 


Phe 


Ser 


Glu 
195 


Lys 


Lys 


Tits 
lip 


Gin 


Thr 

X. 111. 

210 


Ser 




Ser 


Glu 


Gin 
225 


Ser 


Asp 


Asn 


Ser 


Thr 
240 


Val 


Glv 


Pro 


Tin 


Leu 
255 


Leu 


Arg 




Tvr 


Arg 
270 


His 


Lys 


Glu 


Glu 


Lys 
285 


Ser 


Glv 


Ser 


Ser 


Glu 
300 


Met 


Leu 


Thr 


Lys 


Asp 
315 


Gin 


Glv 


Leu 


Gin 


Val 
330 


Ser 


Asp 


Gin 


Leu 


Asn 
345 


Glu 


Glu 






T ,\rcz 
JJjf o 

360 


Thr 


Glv 


Val 


Glu 


Pro 
375 


He 


Glv 


Gin 


Cys 


Ala 
390 


Gin 


Glu 


Ser 


Ala 


Gin 
405 


Arg 


Ser 


Thr 


Asn 


Leu 
420 


Glu 


He 


Ser 


Ala 


Ser 
435 


Tvr 


Ser 


Pro 


Ser 


Leu 
450 


Tvr 


Arc 


Leu 


Glu 


Glu 
465 


Ser 


Ser 


TrD 


Ser 


Gin 
480 


Ser 


Arg 


Glu 


Glu 


Met 
495 


Val 


Ser 


Thr 


Trp 


Ser 
510 


Phe 


He 


Val 


Lys 


Ser 
525 


Lys 


lie 


Phe 


Gin 


Glu 
540 


He 


Gin 


Gin 


Gin 


Arg 
555 
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Ala 


Ala 


Gin 


Lys 


Leu 


He 


Tyr 










560 




Thr 


He 


Pro 


Tyr 


Thr 
575 


Pro 


Arg 


Cys 


His 


Ser 


Ala 


Asn 


Gin 


Trp 










590 




Gly Glu 


Phe 


Arg 


Lys 


Tyr 


Asn 










605 






Pro 


Thr 


Asn 


Thr 


Leu 
620 


Glu 


Glu 


Thr 


Tyr 


Glu 


Tyr 


Thr 
635 


Arg 


Gly 


Gly Val 


Gly 


Glu 


Asn 


Leu 


Thr 










650 






Val 


Lys 


Gin 


Ser 


Arg 
665 


Gly 


Met 


Glu 


Asp 


Ala 


He 


Arg 
680 


Asn 


Phe 


Cys 


Cys 


Arg 


Lys 


Leu 
695 


Lys 


Leu 


Ser 


Pro 


Glu 


Arg 


He 
710 


Asn 


Ser 


Glu 


Ser 


Ala 


Glu 


Glu 
725 


Pro 


Pro 


Pro 


Glu 


Asp Asp 


Met 


Gin 


Leu 










740 







Thr 


Phe 


Asn 


Gin 


Val 


Lys 


Pro Gin 






565 










570 


Phe 


Leu 


Glu 


Val 


Phe 


Leu 


He 


Tyr 






580 










585 


Leu 


Thr 


He 


Glu 


Lys 


Tyr 


Met 


Thr 






595 










600 


Asn 


Asn 


Asn 


Gly Asp Glu 


He 


Thr 






610 










615 


Leu 


Met 


Leu 


Ala 


Phe 


Ser 


His 


Trp 






625 










630 


Glu 


Leu 


Leu 


Val 


Leu 


Asp 


Leu 


Gin 






640 










645 


Asp 


Pro 


Ser 


Val 


He 


Lys 


Pro 


Glu 






655 










660 


Val 


Phe 


Gly 


Pro 


Ala 


Asn 


Leu 


Glv 






670 










675 


He 


Ala 


Lys 


His 


His 


Trp Asn 


Ser 






685 










690 


Pro 


Asp 


Leu 


Lys 


Arg 


Asn 


Asp 


Tyr 






700 










705 


Thr 


Phe 


Gly 


Leu 


Glu 


He 


Lys 


He 






715 










720 


Ala 


Arg Glu Thr Gly Arg Asn 


Ser 






730 










735 
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Met 


Gly 


Ser 


1 






Phe 


Gly 


Arg 


Asn 


He 


Ser 


Val 


Thr 


Pro 


Val 


Val 


Phe 


Ala 


Leu 


Leu 


Gin 


Asn 


Gly 


Arg 


Glu 


Asn 


Lys 


Glu 


Phe 


Trp 


Lys 


Ser 


Lys 


Trp 


Leu 


Leu 


Leu 


Gly 


Arg 


Arg 


Val 



Arg 


His 


Phe 




5 




Phe 


Gin 


Arg 




20 




Cys 


Gly 


He 




35 




His 


His 


Val 




50 




His 


Asn 


His 




65 




Ser 


Ser 


Gly 




80 




Glu 


He 


Trp 




95 




Thr 


Ser 


Ser 




110 




Pro 


Cys 


Val 




125 




Thr 


Ala 


Val 




140 




Ala 


Met 


Leu 




155 




Ser 


Val 


Thr 




170 




Val 


Leu 


Trp 




185 





Glu Gly He 
Val Leu Tyr 
His Tyr Leu 
Cys Arg Pro 
Ser Asn Trp 
Gin Lys Asp 
Glu Leu Ser 
Leu Gly Tyr 
Asp Gly Tyr 
Thr Gin Trp 
He Gin Pro 
Phe Gly Tyr 
Ala Thr Ser 



Tyr Asp His 
10 

Phe He Cys 
25 

Ala Ser Val 
40 

Pro Gly Asn 
55 

Ser Leu Glu 
70 

Tyr Val Thr 
85 

Arg Cys Ser 
100 

Glu Tyr Thr 
115 

lie Tyr Asp 
130 

Asn Leu Val 
145 

Leu Phe Met 
160 

Phe Ser Asp 
175 

Ser Ser Met 
190 



Val Gly His 
15 

Ala Phe Gin 
30 

Phe Met Gly 
45 

Val Ser Gin 
60 

Asp Thr Gly 
75 

Val Gin Leu 
90 

Arg Asn Lys 
105 

Gly Ser Lys 
120 

Gin Asn Thr 
135 

Cys Asp Arg 
150 

Phe Gly Val 
165 

Arg Leu Gly 
180 

Phe Leu Phe 
195 
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Gly 


He 


Ala 


Ala 


Ala 


Phe 


Ala 


Val 








200 








Ala 


Arg 


Phe 


Phe 


Leu 
215 


Ala 


Met 


Val 


Gly 


Phe 


Val 


Tyr 


Val 


Met 


Glu 


Phe 








230 








Trp 


Ala 


Ser 


Val 


His 


Leu 


His 


Ser 








245 








Leu 


Val 


Ala 


Leu 


Thr 
260 


Gly 


Tyr 


Leu 


Gin 


Met 


He 


Leu 


Ser 
275 


Thr 


Val 


Thr 


Trp 


Val 


Leu 


Pro 


Glu 
290 


Thr 


Pro 


Phe 


Tyr 


Glu 


Glu 


Ala 


Gin 
305 


Lys 


He 


Val 


Arg 


Ala 


Ser 


Ser 


Cys 
320 


Lys 


Leu 


Ser 


Gin 


Gly 


Pro 


Val 


Ser 
335 


Asn 


Ser 


Pro 


Leu 


Ser 


Tyr 


Leu 


Phe 
350 


Tyr 


Asn 


Trp 


Thr 


Val 


Trp 


Leu 


He 
365 


Trp 


Phe 


Thr 


Phe 


Ser 


Leu 


Asn 


Ser 
380 


Val 


Asn 


Leu 


Leu 


Phe 


Leu 


Leu 


Gly 
395 


Val 


Val 


Glu 


Cys 


He 


Ala 


Met 


Asp 
410 


Lys 


Val 


Gly 


Ser 


Leu 


Phe 


Cys 


Ser 
425 


Ala 


Leu 


Ala 


Pro 


Gin 


Lys 


His 


Tyr 
440 


He 


Leu 


Gly 


Lys 


Phe 


Ala 


He 


Gly 
455 


Ala 


Ala 


Phe 


Ala 


Glu 


Leu 


Tyr 


Pro 
470 


Thr 


He 


Val 


Gly 


Ser 


Met 


Val 


Cys 
485 


Arg 


Leu 


Ala 


Val 


Asp 


Leu 


Ser 


Ser 
500 


He 


Trp 


He 


Gly 


Thr 


Met 


Ala 


Leu 
515 


Leu 


Ser 


Gly 


Glu 


Thr 


Leu 


Gly 


Lys 
530 


Arg 


Leu 


Ala 


Lys 


Leu 


Glu 


Ser 


Glu 
545 


Asn 


Glu 


Ser 


Thr 


Thr 


Asn 


Asn 


Ser 
560 


Gly 


Leu 


Glu 


Arg 


Asp 


Ser 


Gly 


Leu 
575 


Gly 


Glu 
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Asp 


Tyr 


Tyr 


Thr 


Phe 


Met 


Ala 
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210 


Ala 


Ser 


Gly Tyr 


Leu 


Val 


Val 




220 










225 


He Gly 


Met 


Lys 
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Arg 


Thr 




235 










240 


Phe 


Phe 


Ala 


Val 


Gly Thr 


Leu 




250 










255 


Val 


Arg 


Thr Trp Trp Leu 


Tyr 




265 










270 


Val 


Pro 


Phe 


He 


Leu 


Cys 


Cys 




280 










285 


Trp 


Leu 


Leu 


Ser Glu Gly 


Arg 




295 










300 


Asp 


He 


Met 


Ala 


Lys 


Trp 


Asn 
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315 


Glu 


Leu 


Leu 


Ser 


Leu 


Asp 


Leu 




325 










330 


Thr 


Glu 


Val 


Gin 


Lys 


His 


Asn 




340 
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Ser 
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Thr 
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Arg 


Thr 


Leu 




355 
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Phe 


Tyr 


Ser 




370 
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Gly Gly 


Asn 


Glu 


Tyr 


Leu 


Asn 
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He 


Pro 


Ala 


Tyr 


Thr 


Phe 


Val 




400 










405 


Arg 


Arg 


Thr 


Val 


Leu 


Ala 


Tyr 




415 










420 


Cys 


Gly 


Val 


Val 


Met 


Val 


He 




430 










435 


Val 


Val 


Thr 


Ala 


Met 


Val 


Gly 




445 










450 


Gly Leu 


lie 


Tyr 


Leu 


Tyr 


Thr 




460 










465 


Arg 
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Leu 


Ala 


Val 


Gly 


Ser 




475 










480 


Ser 


He 


Leu 


Ala 


Pro 


Phe 


Ser 




490 










495 


Phe 


He 


Pro 


Gin 


Leu 


Phe 


Val 




505 










510 


Val 


Leu 


Thr 


Leu 


Lys 


Leu 


Pro 




520 










525 


Thr 


Thr 


Trp 


Glu 


Glu 


Ala 


Ala 




535 










540 


Lys 


Ser 


Ser 


Lys 


Leu 


Leu 


Leu 




550 










555 


Lys 


Thr 


Glu 


Ala 


He 


Thr 


Pro 




565 










570 
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Met 


Gly Tyr 


Gin 


Arg 


1 








5 


Leu 


Asp 


Asp 


Arg 


Glu 
20 


Lys 


Thr 


Cys 


Gin 


Ser 
35 


He 


Gly 


Ser 


Gly 


He 
50 


Gly 


Phe 


Pro 


Leu 


Gly 
65 


Thr 


Asp 


Phe 


Ser 


Leu 
80 


Gly 


Thr 


Asp 


Thr 


Tyr 
95 


Pro 


Gly Tyr 


Leu 


Leu 










110 


He 


Ala 


Met 


He 


Ser 
125 


Lys 


Val 


Phe 


Gin 


Arg 
140 


He 


Gly Arg 


His 


Phe 










155 


Leu 


Pro 


Leu 


Ser 


Leu 
170 


Ser 


Leu 


He 


Ser 


Thr 
185 


Met 


Ala 


Arg 


Ala 


He 
200 


Asp 


Ala 


Trp 


Val 


Phe 
215 


Val 


Met 


Ser 


Phe 


Ala 
230 


Tyr 


Ser 


Ser 


Leu 


Glu 
245 


He 


His 


Met 


Ser 


He 
260 


Ala 


Thr 


Cys 


Gly Tyr 










275 


Leu 


Phe 


Glu 


Asn 


Tyr 
290 


Arg 


Phe 


Cys 


Tyr 


Gly 
305 


Cys 


Phe 


Val 


Thr 


Arg 
320 


Asn 


Leu 


Ser 


Ser 


Val 
335 


Thr 


Val 


Ala 


Thr 


Leu 
350 


Val 


Leu 


Glu 


Leu 


Asn 
365 


He 


He 


Pro 


Ser 


Ala 
380 


Thr 


His 


Ser 


Asp 


Lys 
395 


Ala 


Val 


Val 


Met 


Val 
410 


Gin 


Asp 


Cys 


Thr 


His 
425 


Asn 


Phe 


Ser 


Leu 


Thr 
440 


Thr 


Gin 


Leu 


Ser 


Thr 



455 



Gin 


Glu 


Pro 


Val 


He 
10 


Thr 


Leu 


Val 


Ser 


Glu 
25 


Ala 


Ala 


Leu 


Phe 


Asn 
40 


lie Gly 


Leu 


Pro 


Tyr 










55 


He 


Leu 


Leu 


Leu 


Phe 
70 


Val 


Leu 


Leu 


He 


Lys 
85 


Gin 


Ser 


Leu 


Val 


Asn 
100 


Leu 


Ser 


Val 


Leu 


Gin 
115 


Tyr 


Asn 


He 


He 


Ala 
130 


He 


Pro 


Gly 


Val 


Asp 
145 


He 


He 


Gly 


Leu 


Ser 
160 


Tyr 


Arg 


Asn 


He 


Ala 
175 


Gly Leu 


Thr 


Thr 


Leu 










190 


Ser 


Leu 


Gly 


Pro 


His 
205 


Ala 


Lys 


Pro 


Asn 


Ala 

220 


Phe 


He 


Cys 


His 


His 
235 


Glu 


Pro 


Thr 


Val 


Ala 
250 


Val 


He 


Ser 


Val 


Phe 
265 


Leu 


Thr 


Phe 


Thr 


Gly 
280 


Cys 


Arg 


Asn 


Asp 


Asp 
295 


val 


Thr 


Val 


He 


Leu 
310 


Glu 


Val 


He 


Ala 


Asn 
325 


Phe 


His 


He 


Val 


Val 
340 


Val 


Ser 


Leu 


Leu 


He 
355 


Gly Val 


Leu 


Cys 


Ala 










370 


Cys 


Tyr 


Leu 


Lys 


Leu 
385 


He 


Met 


Ser 


Cys 


Val 
400 


Phe Gly 


Phe 


Val 


Met 










415 


Gly Gin 


Glu 


Met 


Phe 










430 


Asn 


Thr 


Ser 


Glu 


Ser 
445 


Leu 


Asn 


He 


Ser 


He 



460 
12/83 



Pro 


Pro 


Gin 


Arg 


Asp 
15 


His 


Glu 


Tyr 


Lys 


Glu 
30 


Val 


Val 


Asn 


Ser 


He 
45 


Ser 


Met 


Lys 


Gin 


Ala 
60 


Trp 


Val 


Ser 


Tyr 


Val 
75 


Gly 


Gly 


Ala 


Leu 


Ser 
90 


Lys 


Thr 


Phe 


Gly 


Phe 
105 


Phe 


Leu 


Tyr 


Pro 


Phe 
120 


Gly 


Asp 


Thr 


Leu 


Ser 
135 


Pro 


Glu 


Asn 


Val 


Phe 
150 


Thr 


Val 


Thr 


Phe 


Thr 
165 


Lys 


Leu 


Gly 


Lys 


Val 
180 


He 


Leu 


Gly 


He 


Val 
195 


lie 


Pro 


Lys 


Thr 


Glu 
210 


He 


Gin 


Ala 


Val 


Gly 
225 


Asn 


Ser 


Phe 


Leu 


Val 
240 


Lys 


Trp 


Ser 


Arg 


Leu 
255 


He 


Cys 


He 


Phe 


Phe 
270 


Phe 


Thr 


Gin 


Gly 


Asp 
285 


Leu 


Val 


Thr 


Phe 


Gly 
300 


Thr 


Tyr 


Pro 


Met 


Glu 
315 


Val 


Phe 


Phe 


Gly 


Gly 
330 


Thr 


Val 


Met 


Val 


He 
345 


Asp 


Cys 


Leu 


Gly 


He 
360 


Thr 


Pro 


Leu 


He 


Phe 
375 


Ser 


Glu 


Glu 


Pro 


Arg 
390 


Met 


Leu 


Pro 


He 


Gly 
405 


Ala 


He 


Thr 


Asn 


Thr 
420 


Tyr 


Cys 


Phe 


Pro 


Asp 
435 


His 


Val 


Gin 


Gin 


Thr 
450 


Phe 


Gin 









<WO 02464 1 SA2_I A> 



<210> 7 
<211> 1018 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 7484823CD1 

<400> 7 



Met 


Gly 


Glu Gly Asp Ser Gly Leu Arg 


Ser 


Pro 


Arg 


Gly 


Pro 


Gin 


1 




5 


10 










15 


Glu 


Glv 


Gly Lys Cys Ser Arg Glu Lys 


Gin 


Lys 


Arg 


Asn 


Met 


Glu 






20 


25 










30 


Glu 


Leu 


Lys Lys Glu Val Val Met Asp 


Asp 


His 


Lys 


Leu 


Thr 


Leu 






35 


40 










45 


Glu 


Glu 


Leu Ser Thr Lys Tyr Ser Val 


Asp 


Leu 


Thr 


Lys 


Gly 


His 






50 


55 










60 


Ser 


His 


Gin Arg Ala Lys Glu He Leu 


Thr 


Arg 


Gly 


Gly 


Pro 


Asn 






65 


70 










75 


Thr 


Val 


Thr Pro Pro Pro Thr Thr Pro 


Glu 


Trp 


Val 


LVS 


Phe 


Cvs 






80 


85 










90 


Lys 


Gin 


Leu Phe Gly Gly Phe Ser Leu 


Leu 


Leu 


Trp 


Thr 


Gly 


Ala 






95 


100 










105 


lie 


Leu 


Cys Phe Val Ala Tyr Ser He 


Gin 


lie 


Tvr 


Phe 


Asn 


Glu 






110 


115 










120 


Glu 


Pro 


Thr Lys Asp Asn Leu Tyr Leu 


Ser 


lie 


Val 


Leu 


Ser 


Val 






125 


130 










135 


Val 


Val 


He Val Thr Gly Cys Phe Ser 


Tyr 


Tyr 


Gin 


Glu 


Ala 


Lys 






140 


145 










150 


Ser 


Ser 


Lys lie Met Glu Ser Phe Lys 


Asn 


Met 


Val 


Pro 


Gin 


Gin 






155 


160 










165 


Ala 


Leu 


Val He Arg Gly Gly Glu Lys 


Met 


Gin 


He 


Asn 


Val 


Gin 






170 


175 










180 


Glu 


Val 


Val Leu Gly Asp Leu Val Glu 


He 


Lys 


Gly 


Gly 


Asp 


Arg 






185 


190 










195 


Val 


Pro 


Ala Asp Leu Arg Leu He Ser 


Ala 


Gin 


Gly 


Cys 


Lys 


Val 






200 


205 










210 


Asp 


Asn 


Ser Ser Leu Thr Gly Glu Ser 


Glu 


Pro 


Gin 


Ser 


Arg 


Ser 






215 


220 










225 


Pro 


Asp 


Phe Thr His Glu Asn Pro Leu 


Glu 


Thr 


Arg 


Asn 


He 


Cys 






230 


235 










240 


Phe 


Phe 


Ser Thr Asn Cys Val Glu Gly 


Thr 


Ala 


Arg 


Gly 


He 


Val 






245 


250 










255 


He 


Ala 


Thr Gly Asp Ser Thr Val Met 


Gly 


Arg 


He 


Ala 


Ser 


Leu 






260 


265 










270 


Thr 


Ser 


Gly Leu Ala Val Gly Gin Thr 


Pro 


He 


Ala 


Ala 


Glu 


He 






275 


280 










285 


Glu 


His 


Phe He His Leu He Thr Val 


Val 


Ala 


Val 


Phe 


Leu 


Gly 






290 


295 










300 


Val 


Thr 


Phe Phe Ala Leu Ser Leu Leu 


Leu 


Gly 


Tyr 


Gly 


Trp 


Leu 






305 


310 










315 


Glu 


Ala 


lie lie Phe Leu lie Gly He 


He 


Val- 


Ala 


Asn 


Val 


Pro 






320 


325 










330 


Glu 


Gly 


Leu Leu Ala Thr Val Thr Val 


Cys 


Leu 


Thr 


Leu 


Thr 


Ala 






335 


340 










345 


Lys 


Arg 


Met Ala Arg Lys Asn Cys Leu 


Val 


Lys 


Asn 


Leu 


Glu 


Ala 






350 


355 










360 


Val 


Glu 


Thr Leu Gly Ser Thr Ser Thr 


He 


Cys 


Ser 


Asp 


Lys 


Thr 






365 


370 










375 


Gly 


Thr 


Leu Thr Gin Asn Arg Met Thr 


Val 


Ala 


His 


Met 


Trp 


Phe 






380 


385 










390 


Asp 


Met 


Thr Val Tyr Glu Ala Asp Thr 


Thr 


Glu 


Glu 


Gin 


Thr 


Gly 



13/83 



NSDOCID: <WO_0246415A2JA> 











395 






Lys 


Thr 


Phe 


Thr 


Lys 
410 


Ser 


Ser 


He 


Ala 


Gly Leu 


Cys 


Asn 


Arg 










425 






He 


Leu 


Pro 


He 


Ala 
440 


Lys 


Arg 


Ser 


Ala 


Leu 


Leu 


Lys 
455 


Phe 


He 


Glu 


Met 


Arg 


Glu 


Lys 
470 


Asn 


Pro 


Ser 


Thr 


Asn 


Lys 


Tyr 
485 


Gin 


Met 


Ser 


Gin 


Thr 


His 


Val 
500 


Leu 


Met 


Leu 


Glu 


Phe 


Cys 


Ser 
515 


Thr 


Phe 


Met 


Asn 


Asp 


Glu 


Met 
530 


Lys 


Glu 


Leu 


Gly 


Gly Leu 


Gly 


Glu 


Arg 










545 






Leu 


Pro 


Ser 


Ser 


Phe 


Ser 


Lys 










560 




lie 


Asn 


Phe 


Pro 


Met 
575 


Asp 


Asn 


Met 


He 


Asp 


Pro 


Pro 
590 


Arg 


Ala 


Cys 


Arg 


Ser 


Ala 


Gly 
605 


He 


Lys 


Pro 


He 


Thr 


Ala 


Lys 
620 


Ala 


He 


Glu 


Gly 


Thr 


Glu 


Thr 
635 


Ala 


Glu 


Pro 


He 


Ser 


Lys 


Val 
650 


Asp 


Ala 


His 


Gly 


Ala 


Glu 


Leu 
665 


Lys 


Asp 


He 


Leu 


Gin 


Asn 


His 
680 


Pro 


Glu 


Gin 


Gin 


Lys 


Leu 


He 
695 


He 


Val 


Val 


Val 


Ala 


Val 


Thr 
710 


Gly 


Asp 


Lys 


Lys 


Ala Asp 


He 


Gly 


He 










725 






Val 


Ser 


Lys 


Gin 


Ala 
740 


Ala 


Asp 


Ala 


Ser 


He 


Val 


Thr 
755 


Gly 


Val 


Asn 


Leu 


Lys 


Lys 


Ser 
770 


He 


Met 


Glu 


He 


Thr 


Pro 


Phe 
785 


Leu 


Met 


Pro 


Leu 


Gly Thr 


He 


Thr 


He 










800 






Met 


Val 


Pro 


Ala 


He 
815 


Ser 


Leu 


He 


Met 


Lys 


Arg 


Leu 
830 


Pro 


Arg 


Asn 


His 


Arg 


Leu 


He 
845 


Gly 


Met 


Gin 


Ala 


Leu 


Ala 


Gly 
860 


Phe 


Phe 







400 




405 


Asp 


Thr 


Trp Phe Met 


Leu Ala 


Arg 






415 




420 


Ala 


Asp 


Phe Lys Ala 


Asn Gin 


Glu 






430 




435 


Ala 


Thr Thr Gly Asp 


Ala Ser 


Glu 






445 




450 


Glu 


Gin 


Ser Tyr Ser 


Ser Val 


Ala 






460 




465 


Lys 


Val 


Ala Glu Val 


Pro Phe 


Asn 






475 




480 


Ser 


He 


His Leu Arg 


Glu Asp 


Ser 






490 




495 


Met 


Lys 


Gly Ala Pro 


Glu Arg 


He 






505 




510 


Leu 


Leu Asn Gly Gin 


Glu Tyr 


Ser 






520 




525 


Ala 


Phe 


Gin Asn Ala 


Tyr Leu 


Glu 






535 




540 


Val 


Leu 


Gly Phe Cys 


Phe Leu 


Asn 






550 




555 


Gly 


Phe 


Pro Phe Asn 


Thr Asp Glu 






565 




570 


Leu 


Cys 


Phe Val Gly 


Leu He 


Ser 






580 




585 


Ala 


Val 


Pro Asp Ala 


Val Ser 


Lys 






595 




600 


Val 


He 


Met Val Thr 


Gly Asp 


His 






610 




615 


Ala 


Lys 


Gly Val Gly 


He He 


Ser 






625 




630 


Glu 


Val 


Ala Ala Arg 


Leu Lys 


He 






640 




645 


Ser 


Ala 


Ala Lys Ala 


He Val 


Val 






655 




660 


He 


Gin 


Ser Lys Gin 


Leu Asp 


Gin 






670 




675 


He 


Val 


Phe Ala Arg 


Thr Ser 


Pro 






685 




690 


Glu 


Gly Cys Gin Arg 


Leu Gly Ala 






700 




705 


Gly 


Val 


Asn Asp Ser 


Pro Ala 


Leu 






715 




720 


Ala 


Met 


Gly He Ser 


Gly Ser Asp 






730 




735 


Met 


He 


Leu Leu Asp 


Asp Asn 


Phe 






745 




750 


Glu 


Glu 


Gly Arg Leu 


He Phe 


Asp 






760 




765 


Tyr 


Thr 


Leu Thr Ser 


Asn He 


Pro 






775 




780 


Phe 


He 


He Leu Gly 


He Pro 


Leu 






790 




795 


Leu 


Cys 


He Asp Leu 


Gly Thr 


Asp 






805 




810 


Ala 


Tyr 


Glu Ser Ala 


Glu Ser 


Asp 






820 




825 


Asn 


Pro 


Lys Thr Asp 


Asn Leu 


Val 






835 




840 


Ala 


Tyr 


Gly Gin He 


Gly Met 


He 






850 




855 


Thr 


Tyr 


Phe Val He 


Leu Ala 


Glu 






865 




870 



14/83 



NSOOCia <WO 024641 5A2_1A> 



Asn 


Gly 


Phe 


Arg 


Pro 
875 


Val 


Asp 


Leu 


Glu 


Asp 


Lys 


Tyr 


Leu 
890 


Asn 


Asp 


Leu 


Trp 


Thr 


Tyr 


Glu 


Gin 
905 


Arg 


Lys 


Val 


Ala 


Phe 


Phe 


Val 


Thr 
920 


He 


Val 


Val 


He 


Ser 


Lys 


Thr 


Arg 
935 


Arg 


Asn 


Ser 


Asn 


Lys 


Val 


Leu 


He 
950 


Phe 


Gly 


He 


Ala 


Phe 


Leu 


Ser 


Tyr 
965 


Thr 


Pro 


Gly 


Tyr 


Pro 


Leu 


Lys 


He 
980 


Thr 


Trp 


Trp 


He 


Leu 


He 


Phe 


Val 
995 


Tyr 


Asp 


Glu 


Gin 


His 


Pro 


Asp 


Cly 


Trp 


Val 


Glu 



1010 

<210> 8 
<211> 313 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 143935CD1 

<400> 8 



Met 


Glu 


Glu Met 


Ser 


Gly 


Glu 


Ser 


1 






5 








Ala 


Ala 


Ala Thr 


Arg 
20 


Thr 


Thr 


Ser 


Ser 


Lys 


Tyr Val 


Lys 
35 


Leu 


Asn 


Val 


Thr 


Met 


Gin Thr 


Leu 
50 


Thr 


Lys 


Gin 


Phe 


Ser 


Gly Arg 


Met 
65 


Glu 


Val 


Leu 


Leu 


He 


Asp Arg 


Cys 
80 


Gly 


Lys 


His 


Leu 


Arg 


Asp Gly 


Ala 
95 


Val 


Pro 


Leu 


Glu 


Glu 


Leu Leu 


Ala 
110 


Glu 


Ala 


Lys 


Val 


Glu 


Glu Cys 


Gin 
125 


Ala 


Ala 


Leu 


Pro 


Phe 


Cys Lys 


Val 
140 


Pro 


Val 


He 


Lys 


Leu 


He Ala 


Thr 
155 


Ser 


Asn 


Lys 


Asn 


Arg 


Ser Asn 


Asn 
170 


Lys 


Tyr 


Ser 


Asn 


Met 


Leu Lys 


Asn 
185 


He 


Glu 


Leu 


Phe 


Asn 


Gly Arg 


Val 
200 


Leu 


Phe 


He 


He 


Cys 


Cys Trp 


Ser 
215 


Phe 


Tyr 


Gly 


Val 


Cys 


Cys Thr 


Ser 


lie 


Val 


Tyr 



230 



Leu 


Gly 


lie Arg 


Leu His 


Trp 




880 








885 


Glu 


Asp 


Ser Tyr 


Gly Gin 


Gin 




895 








900 


Val 


Glu 


Phe Thr 


Cys Gin 


Thr 




910 








915 


Val 


Gin 


Trp Ala 


Asp Leu 


lie 




925 








930 


Leu 


Phe 


Gin Gin 


Gly Met 


Arg 




940 








945 


Leu 


Glu 


Glu Thr 


Leu Leu 


Ala 




955 








960 


Met 


Asp 


Val 


Ala 


Leu Arg 


Met 




970 








975 


Leu 


Cys 


Ala 


lie 


Pro Tyr 


Ser 




985 








990 


lie 


Arg 


Lys 


Leu 


Leu He 


Arg 


1000 






1005 


Arg 


Glu 


Thr 


Tyr 


Tyr 




1015 










Val 


Val 


Ser 


Ser 


Ala Val 


Pro 




10 








15 


Phe 


Lys 


Gly Thr 


Ser Pro 


Ser 




25 








30 


Gly 


Gly 


Ala 


Leu 


Tyr Tyr 


Thr 




40 








45 


Asp 


Thr 


Met 


Leu 


Lys Ala 


Met 




55 








60 


Thr 


Asp 


Ser 


Glu 


Gly Trp 


He 




70 








75 


Phe 


Gly 


Thr 


lie 


Leu Asn 


Tyr 




85 








90 


Pro 


Glu 


Ser Arg 


Arg Glu 


He 




100 








105 


Tyr 


Tyr 


Leu Val 


Gin Gly Leu 




115 








120 


Gin 


Asn 


Lys Asp 


Thr Tyr 


Glu 




130 








135 


Thr 


Ser 


Ser 


Lys 


Glu Glu 


Gin 




145 








150 


Pro 


Ala 


Val 


Lys 


Leu Leu 


Tyr 




160 








165 


Tyr 


Thr 


Ser 


Asn 


Ser Asp 


Asp 




175 








180 


Phe 


Asp 


Lys 


Leu 


Ser Leu 


Arg 




190 








195 


Lys 


Asp 


Val 


lie 


Gly Asp Glu 




205 








210 


Gin 


Gly 


Arg Lys 


lie Ala 


Glu 




220 








225 


Ala 


Thr 


Glu 


Lys 


Lys Gin 


Thr 




235 








240 



15/83 



.5DOCID: <WO 024641 5A2JA> 



Lys 


Val 


Glu 


Phe 


Pro 


Glu Ala Arg lie Tyr 


Glu 


Glu 


Thr Leu 


Asn 










245 


250 








255 


lie 


Leu 


Leu 


Tyr 


Glu 


Ala Gin Asp Gly Arg 


Gly 


Pro 


Asp Asn 


Ala 










260 


265 








270 


Leu 


Leu 


Glu 


Ala 


Thr 


Gly Gly Ala Ala Gly 


Arg 


Ser 


His His 


Leu 










275 


280 








285 


Asp 


Glu 


Asp 


Glu 


Glu 


Arg Glu Arg lie Glu 


Arg 


Val 


Arg Arg 


lie 










290 


295 








300 


His 


lie 


Lys 


Arg 


Pro 


Asp Asp Arg Ala His 


Leu 


His 


Gin 












305 


310 











<210> 9 

<211> 921 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 5923789CD1 

<400> 9 



Met 


Ala 


Trp 


Leu 


Arg 


Leu 


Gin Pro 


Leu 


Thr 


Ser 


Ala Phe Leu 


His 


1 








5 








10 






15 


Phe 


Gly 


Leu 


Val 


Thr 


Phe 


Val Leu 


Phe 


Leu 


Asn 


Gly Leu Arg 


Ala 










20 








25 






30 


Glu 


Ala 


Gly 


Gly 


Ser 


Gly 


Asp Val 


Pro 


Ser 


Thr 


Gly Gin Asn 


Asn 










35 








40 






45 


Glu 


Ser 


Cys 


Ser 


Gly 


Ser 


Ser Asp 


Cys 


Lys 


Glu Gly Val He 


Leu 










50 








55 






60 


Pro 


lie 


Trp 


Tyr 


Pro 


Glu 


Asn Pro 


Ser 


Leu 


Gly Asp Lys He 


Ala 










65 








70 






75 


Arg 


Val 


He 


Val 


Tyr 


Phe 


Val Ala 


Leu 


He 


Tyr 


Met Phe Leu 


Gly 










80 








85 






90 


Val 


Ser 


He 


He 


Ala 


Asp 


Arg Phe 


Met 


Ala 


Ser 


He Glu Val 


He 










95 








100 






105 


Thr 


Ser 


Gin 


Glu 


Arg 


Glu 


Val Thr 


He 


Lys 


Lys 


Pro Asn Gly 


Glu 










110 








115 






120 


Thr 


Ser 


Thr 


Thr 


Thr 


He 


Arg Val 


Trp Asn 


Glu 


Thr Val Ser 


Asn 










125 








130 






135 


Leu 


Thr 


Leu 


Met 


Ala 


Leu 


Gly Ser 


Ser 


Ala 


Pro 


Glu He Leu 


Leu 










140 








145 






150 


Ser 


Leu 


He 


Glu 


Val 


Cys 


Gly His Gly Phe 


He Ala Gly Asp 


Leu 










155 








160 






165 


Gly 


Pro 


Ser 


Thr 


He 


Val 


Gly Ser Ala Ala 


Phe 


Asn Met Phe 


He 










170 








175 






180 


He 


lie 


Gly 


He 


Cys 


Val 


Tyr Val 


He 


Pro 


Asp 


Gly Glu Thr 


Arg 










185 








190 






195 


Lys 


He 


Lys 


His 


Leu 


Arg 


Val Phe 


Phe 


He 


Thr 


Ala Ala Trp 


Ser 










200 








205 






210 


He 


Phe 


Ala 


Tyr 


He 


Trp 


Leu Tyr 


Met 


He 


Leu 


Ala Val Phe 


Ser 










215 








220 






225 


Pro 


Gly 


Val 


Val 


Gin 


Val 


Trp Glu Gly Leu 


Leu 


Thr Leu Phe 


Phe 










230 








235 






240 


Phe 


Pro 


Val 


Cys 


Val 


Leu 


Leu Ala Trp Val 


Ala 


Asp Lys Arg 


Leu 










245 








250 






255 


Leu 


Phe 


Tyr 


Lys 


Tyr 


Met 


His Lys 


Lys 


Tyr 


Arg 


Thr Asp Lys 


His 










260 








265 






270 


Arg 


Gly 


He 


He 


He 


Glu 


Thr Glu Gly Asp 


His 


Pro Lys Gly 


He 










275 








280 






285 


Glu 


Met 


Asp 


Gly 


Lys 


Met 


Met Asn 


Ser 


His 


Phe 


Leu Asp Gly 


Asn 










290 








295 






300 


Leu 


Val 


Pro 


Leu 


Glu 


Gly 


Lys Glu 


Val 


Asp 


Glu 


Ser Arg Arg 


Glu 










305 








310 




315 



16/83 



SNSDOCID: <WO 024641 5A2_IA> 



Met 


He 


Arg He Leu 


Lys Asp Leu Lys Gin 


Lys 


His 


Pro 


Glu 


Lys 






320 






325 










330 


Asp 


Leu 


Asp Gin Leu 


Val- 


Glu Met Ala 


Asn 


Tyr 


Tyr 


Ala 


Leu 


Ser 






335 






340 










345 


His 


Gin 


Gin Lys Ser 


Arg 


Ala Phe Tyr 


Arg 


lie 


Gin 


Ala 


Thr 


Arg 






350 






355 










360 


Met 


Met 


Thr Gly Ala 


Gly 


Asn lie Leu 


Lys 


Lys 


His 


Ala 


Ala 


Glu 






365 






370 










375 


Gin 


Ala 


Lys Lys Ala 


Ser 


Ser Met Ser 


Glu 


Val 


His 


Thr 


Asp 


Glu 






380 






385 










390 


Pro 


Glu 


Asp Phe He 


Ser 


Lys Val Phe 


Phe 


Asp 


Pro 


Cys 


Ser 


Tyr 






395 






400 










405 


Gin 


Cys 


Leu Glu Asn 


Cys 


Gly Ala Val 


Leu 


Leu 


Thr 


Val 


Val 


Arg 






410 






415 










420 


Lys 


Gly 


Gly Asp Met 


Ser 


Lys Thr Met 


Tyr 


Val 


Asp 


Tyr 


Lys 


Thr 






425 






430 










435 


Glu 


Asp 


Gly Ser Ala 


Asn 


Ala Gly Ala Asp 


Tyr 


Glu 


Phe 


Thr 


Glu 






440 






445 










450 


Gly 


Thr 


Val Val Leu 


Lys 


Pro Gly Glu 


Thr 


Gin 


Lys 


Glu 


Phe 


Ser 






455 






460 










465 


Val 


Gly 


He lie Asp 


Asp 


Asp lie Phe 


Glu 


Glu 


Asp 


Glu 


His 


Phe 






470 






475 










480 


Phe 


Val 


Arg Leu Ser 


Asn 


Val Arg He 


Glu 


Glu 


Glu 


Gin 


Pro 


Glu 






485 






490 










495 


Glu 


Gly 


Met Pro Pro 


Ala 


lie Phe Asn 


Ser 


Leu 


Pro 


Leu 


Pro 


Arg 






500 






505 










510 


Ala 


Val 


Leu Ala Ser 


Pro 


Cys Val Ala 


Thr 


Val 


Thr 


He 


Leu 


Asp 






515 






520 










525 


Asp 


Asp 


His Ala Gly 


lie 


Phe Thr Phe 


Glu 


Cys 


Asp 


Thr 


He 


His 






530 






535 










540 


Val 


Ser 


Glu Ser lie 


Gly 


Val Met Glu 


Val 


Lys 


Val 


Leu 


Arg 


Thr 






545 






550 










555 


Ser 


Gly 


Ala Arg Gly 


Thr 


Val lie Val 


Pro 


Phe 


Arg 


Thr 


Val 


Glu 






560 






565 










570 


Gly 


Thr 


Ala Lys Gly 


Gly 


Gly Glu Asp Phe 


Glu 


Asp 


Thr 


Tyr 


Gly 






575 






580 










585 


Glu 


Leu 


Glu Phe Lys 


Asn 


Asp Glu Thr Val 


Lys 


Thr 


He 


Arg 


Val 






590 






595 










600 


Lys 


He 


Val Asp Glu 


Glu 


Glu Tyr Glu Arg 


Gin 


Glu 


Asn 


Phe 


Phe 






605 






610 










615 


He 


Ala 


Leu Gly Glu 


Pro 


Lys Trp Met 


Glu 


Arg 


Gly 


He 


Ser 


Asp 






620 






625 










630 


Val 


Thr 


Asp Arg Lys 


Leu 


Thr Met Glu 


Glu 


Glu 


Glu 


Ala 


Lys 


Arg 






635 






640 










645 


He 


Ala 


Glu Met Gly 


Lys 


Pro Val Leu Gly 


Glu 


His 


Pro 


Lys 


Leu 






650 






655 










660 


Glu 


Val 


lie lie Glu 


Glu 


Ser Tyr Glu 


Phe 


Lys 


Thr 


Thr 


Val 


Asp 






665 






670 










675 


Lys 


Leu 


lie Lys Lys 


Thr 


Asn Leu Ala 


Leu 


Val 


Val 


Gly 


Thr 


His 






680 






685 










690 


Ser 


Trp 


Arg Asp Gin 


Phe 


Met Glu Ala 


He 


Thr 


Val 


Ser 


Ala 


Ala 






695 






700 










705 


Gly 


Asp 


Glu Asp Glu 


Asp Glu Ser Gly Glu 


Glu 


Arg 


Leu 


Pro 


Ser 






710 






715 










720 


Cys 


Phe 


Asp Tyr Val 


Met 


His Phe Leu 


Thr 


Val 


Phe 


Trp 


Lys 


Val 






725 






730 










735 


Leu 


Phe 


Ala Cys Val 


Pro 


Pro Thr Glu 


Tyr 


Cys 


His 


Gly 


Trp 


Ala 






740 






745 










750 


Cys 


Phe 


Ala Val Ser 


lie 


Leu lie lie Gly 


Met 


Leu 


Thr 


Ala 


He 






755 






760 










765 


He 


Gly 


Asp Leu Ala 


Ser His Phe Gly Cys 


Thr 


lie 


Gly 


Leu 


Lys 






770 






775 










780 


Asp 


Ser 


Val Thr Ala 


Val 


Val Phe Val 


Ala 


Phe 


Gly 


Thr 


Ser 


Val 
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785 




Pro 


Asp 


Thr 


Phe 


Ala Ser Lys 
800 


Ala 


Ala 


Asp 


Ala 


Ser 


lie Gly Asn 
815 


Val 


Val 


Phe 


Leu 


Gly 


lie Gly Leu 
830 


Ala 


Trp 


Ala 


Leu 


Gin 


Gly Gin Glu 
845 


Phe 


Ala 


Phe 


Ser 


Val 


Thr Leu Phe 
860 


Thr 


Ser 


Val 


Leu 


Leu 


Tyr Arg Arg 
875 


Arg 


Gly 


Gly 


Pro 


Arg 


Gly Cys Lys 
890 


Leu 


Ser 


Leu 


Trp 


Leu 


Leu Tyr lie 
905 


Leu 


Cys 


Tyr 


lie 


Lys 


Gly Phe 
920 





<210> 10 

<211> 1466 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 6046484CD1 

<400> 10 



Met 


Tyr 


He 


Arg 


Val 


Ser Tyr 


Asp 


1 








S 




His 


Leu 


Met 


Val 


Lys Asp Trp 


Gin 










20 






lie 


Ser 


Val 


His Gly Gly Leu 


Gin 










35 






Leu 


Lys 


Gin 


Val 


Phe 


Gly Lys 


Gly 










50 






Thr 


Gly 


Ala 


Trp 


lie 


Phe Thr 


Gly 










65 




Ser 


His 


Val 


Gly Asp Ala Leu 


Lys 










80 




Gly 


Arg 


Val 


Cys 


Ala 


He Gly 


He 










95 






Asn 


Lys 


Glu 


Asp 


Leu 


Val Gly 


Lys 










110 




Thr 


Met 


Ser 


Asn 


Pro 


Leu Ser 


Lys 










125 




His 


Thr 


His 


Phe 


He 


Leu Ala 


Asp 










140 




Gly 


Ala 


Glu 


Val 


Lys 


Leu Arg 


Arg 










155 




Leu 


Gin 


Lys 


He 


Asn 


Thr Arg 


He 










170 






Ala 


Leu 


He 


Val 


Glu 


Gly Gly 


Pro 










185 






Glu 


Tyr 


Leu 


Arg 


Asp 


Thr Pro 


Pro 










200 






Gly 


Ser 


Gly 


Arg 


Ala 


Ser Asp 


He 










215 






Ser 


Glu 


Glu 


Gly Gly Leu He 


Asn 










230 






Leu 


Val 


Thr 


He 


Gin 


Lys Thr 


Phe 





790 










795 


Ala 


Ala 


Leu 


Gin 


Asp 


Val 


Tvr 




805 










810 


Thr 


Gly 


Ser 


Asn 


Ala 


Val 


Asn 




820 










825 


Trp 


Ser 


Val 


Ala 


Ala 


He 


Tvr 




835 










840 


His 


Val 


Ser 


Ala 


Gly Thr 


Leu 




850 










855 


He 


Phe 


Ala 


Phe 


Val 


Cys 


He 




865 










870 


Pro 


His 


Leu 


Gly 


Gly 


Glu 


Leu 




880 










885 


Ala 


Thr 


Thr 


Trp 


Leu 


Phe 


Val 




895 










900 


Phe 


Ala 


Thr 


Leu 


Glu 


Ala 


Tyr 




910 










915 



Thr Lys 


Pro Asp 


Ser 


Leu 


Leu 


10 








15 


Leu Glu 


Leu Pro 


Lys 


Leu 


Leu 


25 








30 


Asn Phe 


Glu Met 


Gin 


Pro 


Lys 


40 








45 


Leu lie 


Lys Ala 


Ala 


Met 


Thr 


55 








60 


Gly Val 


Ser Thr 


Gly 


Val 


He 


70 








75 


Asp His 


Ser Ser 


Lys 


Ser 


Arg 


85 








90 


Ala Pro 


Trp Gly 


lie 


Val 


Glu 


100 








105 


Asp Val 


Thr Arg Val 


Tyr 


Gin 


115 








120 


Leu Ser 


Val Leu 


Asn 


Asn 


Ser 


130 








135 


Asn Gly Thr Leu 


Gly Lys 


Tyr 


145 








150 


Gin Leu 


Glu Lys 


His 


lie 


Ser 


160 








165 


Gly Gin Gly Val 


Pro 


Val 


Val 


175 








180 


Asn Val 


He Ser 


lie 


Val 


Leu 


190 








195 


Val Pro 


Val Val 


Val 


Cys 


Asp 


205 








210 


Leu Ala 


Phe Gly His 


Lys 


Tyr 


220 








225 


Glu Ser 


Leu Arg Asp Gin 


Leu 


235 








240 


Thr Tyr 


Thr Arg 


Thr 


Gin 


Ala 
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245 










250 








255 


Gin 


His 


Leu Phe 


He 
260 


He 


Leu 


Met 


Glu 


Cys 
265 


Met 


Lys Lys 


Lys 


Glu 
270 


Leu 


lie 


Thr Val 


Phe 
275 


Arg 


Met 


Gly 


Ser 


Glu 
280 


Gly 


His Gin 


Asp 


He 
285 


Asp 


Leu 


Ala He 


Leu 
290 


Thr 


Ala 


Leu 


Leu 


Lys 
295 


Gly 


Ala Asn 


Ala 


Ser 
300 


Ala 


Pro 


Asp Gin 


Leu 
305 


Ser 


Leu 


Ala 


Leu 


Ala 
310 


Tro 


Asn Arg 


Val 


ASD 

315 


He 


Ala 


Arg Ser 


Gin 
320 


He 


Phe 


He 


Tvr 


Gly 
325 


Gin 


Gin Trp 


Pro 


Val 
330 


Glv 


Ser 


Leu Glu 


Gin 
335 


Ala 


Met 


Leu 


Asd 


Ala 
340 


Leu 


Val Leu 


Asp 


Arg 
345 


Val 


Asd 


Phe Val 


Lys 
350 


Leu 


Leu 


He 


Glu 


Asn 
355 


Glv 


Val Ser 


Met 


His 
360 


Ara 


Phe 


Leu Thr 


He 
365 


Ser 


Arg 


Leu 


Glu 


Glu 
370 


Leu 


Tyr Asn 


Thr 


Arg 
375 


His 


Glv 


Pro Ser 


Asn 
380 


Thr 


Leu 


Tvr 

A Jr 


His 


Leu 
385 


Val 


Ara Asd 


Val 


Lys 
390 


Lys 


Glv 


Asn Leu 


Pro 
395 


Pro 


Asp 


Tvr 


Arg 


He 
400 


Ser 


Leu He 


Asp 


He 
405 


Glv 


Leu 


Val He 


Glu 
410 


Tvr 


Leu 


Met 


Glv 


Glv 
415 


Ala 


Tyr Arg 


Cys 


Asn 
420 


Tvr 


Thr 


Arg Lys 


Ara 


Phe 


Arg 


Thr 


Leu 


Tvr 

Jr 


His 


Asn Leu 


Phe Gly 








425 










430 








435 


Pro 


Lys 


Arg Asp 


Asp 
440 


He 


Pro 


Leu 


Ara 


Ara 
445 


Glv 

UA Jr 


Arg Lys 


Thr 


Thr 
450 


Lys 


Lys 


Arg Glu 


Glu 
455 


Glu 


Val 


Asp 


He 


Asp 
460 


Leu 


Asp Asp 


Pro 


Glu 
465 


He 


Asn 


His Phe 


Pro 
470 


Phe 


Pro 


Phe 


His 


Glu 
475 


Leu 


Met Val 


Trp 


Ala 
480 


Val 


Leu 


Met Lvs 


Arg 
485 


Gin 


Lys 


Met 


Ala 


Leu 
490 


Phe 


Phe Trp 


Gin 


His 
495 


Glv 


Glu 


Glu Ala 

vj^\j nit* 


Met 
500 


Ala 


Lys 


Ala 


Leu 


Val 
505 


Ala 


Cys Lys 


Leu 


Cys 
510 


Lys 


Ala 


Met Ala 


His 


Glu 


Ala 


Ser 


Glu 


Asn 


Asp 


Met Val 


Asp Asp 








515 










520 








525 


He 


Ser 


Gin Glu 


Leu 
530 


Asn 


His 


Asn 


Ser 


Arg 
535 


Asp 


Phe Gly 


Gin 


Leu 
540 


Ala 


Val 


Glu Leu 


Leu 
545 


Asp 


Gin 


Ser 


Tvr 
j 


Lys 
550 


Gin 


A<?n Glu 


Gin 


Leu 
555 


Ala 


Met 


Lvs Leu 


Leu 
560 


Thr 


Tvr 


Glu 


Leu 


Lys 
565 


Asn 


Trp Ser 


Asn 


Ala 
570 


Thr 


Cvs 


Leu Gin 


Leu 
575 


Ala 


Val 


Ala 


Ala 


Lys 
580 


His 


Arg Asp 


Phe 


He 
585 


Ala 


His 


Thr Cys 


Ser 
590 


Gin 


Met 


Leu 


Leu 


Thr 
595 


Asd 


Met Trp 


Met 


Gly 
600 


Arq 


Leu 


Arg Met 


Arg 


Lvs 


Asn 


Ser 


Gly 


Leu 


LVS 

•"Jr 


Val He 


Leu Gly 








605 










610 








615 


He 


Leu 


Leu Pro 


Pro 
620 


Thr 


He 


Leu 


Phe 


Leu 
625 


Glu 


Phe Arg 


Thr 


Tyr 
630 


Asp 


Asp 


Phe Ser 


Tyr 
635 


Gin 


Thr 


Ser 


Lys. 


Glu 
640 


Asn 


Glu Asp 


Gly 


Lys 
645 


Glu 


Lys 


Glu Glu 


Glu 
650 


Asn 


Thr 


Asp 


Ala 


Asn 
655 


Ala 


Asp Ala 


Gly 


Ser 
660 


Arg 


Lys 


Gly Asp 


Glu 
665 


Glu 


Asn 


Glu 


His 


Lys 
670 


Lys 


Gin Arg 


Ser 


He 
675 


Pro 


He 


Gly Thr 


Lys 
680 


He 


Cys 


Glu 


Phe 


Tyr 
685 


Asn 


Ala Pro 


He 


Val 
690 


Lys 


Phe 


Trp Phe 


Phe 
695 


Gin 


lie 


Ser 


Tyr 


Leu 
700 


Gly 


Tyr Leu 


Leu 


Leu 
705 


Phe 


Asn 


Tyr Val 


He 
710 


Leu 


Val 


Arg 


Met 


Asp 
715 


Gly 


Trp Pro 


Ser 


Leu 
720 
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Gin 


Glu 


Trp 


> He 


Val He 


Ser 


Tyr 










725 




Lys 


He 


Arg 


Glu 


Val Ala 


Thr 


Pro 










740 






Ala 


Arg 


Lys 


Ser 


Lys Phe Gly Phe 










755 






Asp 


Leu 


Val 


Ala 


He Ser 


Thr 


Phe 










770 






Leu 


Gin 


Asn 


Gin 


Pro Tyr 


Met 


Gly 










785 




Val 


Asp 


He 


He 


Phe Trp Tyr 


He 










800 






Val 


Asn 


Lys 


Tyr 


Leu Gly Pro Tyr 










815 






Val 


Ser 


Ser Gly 


He Leu 


Trp 


Val 










830 






Met 


Ser 


Phe 


Gly Val Ala 


Arg 


Gin 










845 






Lys 


Pro 


Ser 


Trp 


Lys Leu 


Ala 


Arg 










860 






Trp 


Met 


He 


Tyr 


Gly Glu 


Val 


Phe 










875 






Ser 


Phe 


Phe 


Leu 


Ser Ala 


Pro 












890 




Glu 


Gly 


Lys 


Arg 


Leu Pro 


Pro 


Cys 










905 




Pro 


Ala 


Leu 


Met 


Ala Cys 


Tyr 


Leu 










920 






Val 


Asn 


Leu 


Leu 


He Ala 


Val 


Phe 










935 






Lys 


Ser 


He 


Ser 


Asn Gin 


Val 


Tm 










950 




He 


Met 


Thr 


Phe 


His Asp Arg 


Pro 










965 






He 


Leu 


Ser 


His 


He Tyr 


He 


He 










980 






Cys 


Arg 


Lys 


Lys 


Arg Glu 


Gly 


Asp 










995 






Leu 


Ser 


Met 


Phe 


Leu Ser 


Asp 


Glu 








1010 






Phe 


Glu 


Glu 


Gin 


Cys Val 


Gin 


Glu 








1025 






Glu 


Gin 


Gin 


Ser 


Ser Ser 


Asp 


Glu 








1040 






Arg 


Val 


Glu 


Asn 


Met Ser 


Met 


Arg 








1055 




Glu 


Thr 


Phe 


Met 


Lys Thr 


Ser 


Leu 








1070 






Ala 


Gin 


Leu 


Glu 


Glu Leu. 


Ser 


Asn 








1085 






Asn 


Leu 


Ala Gly 


He Asp 


Arg 


Ser 








1100 






Arg 


Ala 


Ser 


Ser 


Glu Cys 


Glu 


Ala 








1115 






Ser 


He 


Asn 


Ser 


Ala Asp 


Gly 


Tyr 








1130 






Asn Gly 


Glu 


Glu 


Leu Leu 


Phe 


Glu 








1145 






Pro 


Gly Thr Gly Val Arg Lys Lys 








1160 






Glu 


Glu 


Lys 


Asp Val Lys 


Thr 


His 








1175 






Ser 


Leu 


His 


Leu 


Ser Leu Gly Thr 



He Val 


Ser 


Leu 


Ala 


Leu Glu 


730 








735 


Lys Ala 


Ser 


Pro 


Ser 


Pro Leu 


745 








750 


Gin Glu 


Tyr 


Trp 


Asn 


He Thr 


760 








765 


Met He Gly Ala 


He 


Leu Arg 


775 








780 


Tyr Gly Arg Val 


He 


Tyr Cys 


790 








795 


Arg Val 


Leu 


Asp 


He 


Phe Gly 


805 








810 


Val Met 


Met 


He Gly Lys Met 


820 










Val He 


Met 


Leu 


Val 


vaj. Leu 


835 








Q A O 


Ala He 


Leu 


His 


Pro 


V*-LU CaXU 


850 








ODD 


Asn He 


Phe 


Tyr 


Mot- 


rxo J>yr 


865 








O *7 rt 


Ala Asp 


Gin 


He 


Asp 


Arg Lys 


880 








QQC 

OOJ 


Gly Glu Asn 


Leu 


Tyr 


ASp blU 


895 








y UU 


He Pro Gly Ala 


Trp 


Leu Thr 


910 








y i j 


Leu Val 


Ala 


Asn 


lie 


Leu Leu 


925 








Q 1 ft 


Ser Asn 


Thr 


Phe 


rTie 


<jj-u vax 


940 










Lys Phe 


Gin 


Arg 


Tyr 


Gin Leu 


955 








you 


Val Leu 


Pro 


Pro 


tr L. \J 


TVfof* Tl a 


970 








/ D 


He Met 


Arg 


Leu 


C iTi-V- 




985 








Q q n 


Gin Glu 


Glu 


Arg 


7\ on 


niy biy 


1000 








i nn^ 
j. \j \j _> 


Glu Leu 


Lys 


Arg 


JJCU 


nib oiu 


1015 










His Phe 


Arg 


Glu 




VaJLU nSp 


1030 










Arg He Arg Val 


Thr 


c pr pi,, 

OCX ox u 


1045 








1050 


Leu Glu 


Glu 


He 


Asn 


Glu Arg 


1060 








1065 


Gin Thr 


Val 


Asp 


Leu 


Arg Leu 


1075 








1080 


Arg Met 


Val 


Asn 


Ala 


Leu Glu 


1090 








1095 


Asp Leu 


He 


Gin 


Ala 


Arg Ser 


1105 








1110 


Thr Tyr 


Leu 


Leu 


Arg 


Gin Ser 


1120 








1125 


Ser Leu 


Tyr 


Arg 


Tyr 


His Phe 


1135 








1140 


Asp Thr 


Ser 


Leu 


Ser 


Thr Ser 


1150 








1155 


Thr Cys 


Ser 


Phe 


Arg 


He Lys 


1165 








1170 


Leu Val 


Pro 


Glu 


Cys 


Gin Asn 


1180 






1185 


Ser Thr 


Ser 


Ala 


Thr 


Pro Asp 
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1190 1195 . 1200 

Gly Ser His Leu Ala Val Asp Asp Leu Lys Asn Ala Glu Glu Ser 

1205 1210 1215 

Lys Leu Gly Pro Asp lie Gly lie Ser Lys Glu Asp Asp Glu Arg 

1220 1225 1230 

Gin Thr Asp Ser Lys Lys Glu Glu Thr lie Ser Pro Ser Leu Asn 

1235 1240 1245 

Lys Thr Asp Val He His Gly Gin Asp Lys Ser Asp Val Gin Asn 

1250 1255 1260 

Thr Gin Leu Thr Val Glu Thr Thr Asn He Glu Gly Thr He Ser 

1265 1270 1275 

Tyr Pro Leu Glu Glu Thr Lys He Thr Arg Tyr Phe Pro Asp Glu 

1280 1285 1290 

Thr He Asn Ala Cys Lys Thr Met Lys Ser Arg Ser Phe Val Tyr 

1295 1300 1305 

Ser Arg Gly Arg Lys Leu Val Gly Gly Val Asn Gin Asp Val Glu 

1310 1315 1320 

Tyr Ser Ser He Thr Asp Gin Gin Leu Thr Thr Glu Trp Gin Cys 

1325 1330 1335 

Gin Val Gin Lys He Thr Arg Ser His Ser Thr Asp He Pro Tyr 

1340 1345 1350 

He Val Ser Glu Ala Ala Val Gin Ala Glu His Lys Glu Gin Phe 

1355 1360 "* 1365 

Ala Asp Met Gin Asp Glu His His Val Ala Glu Ala He Pro Arg 

1370 1375 1380 

He Pro Arg Leu Ser Leu Thr He Thr Asp Arg Asn Gly Met Glu 

1385 1390 - 1395 

Asn Leu Leu Ser Val Lys Pro Asp Gin Thr Leu Gly Phe Pro Ser 

1400 1405 1410 

Leu Arg Ser Lys Ser Leu His Gly His Pro Arg Asn Val Lys Ser 

1415 1420 1425 

He Gin Gly Lys Leu Asp Arg Ser Gly His Ala Ser Ser Val Ser 

1430 1435 1440 

Ser Leu Val He Val Ser Gly Met Thr Ala Glu Glu Lys Lys Val 

1445 1450 * 1455 
Lys Lys Glu Lys Ala Ser Thr Glu Thr Glu Cys 

1460 1465 

<210> 11 
<211> 222 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: 7481427CD1 

<400> 11 

Met Thr Glu Gin Ala He Ser Phe Ala Lys Asp Phe Leu Ala Gly 
1 5 10 15 

Gly He Thr Ala Ala He Ser Lys Thr Ala Val Ala Ser He Lys 

20 25 30 

Arg Val Gin Leu Leu Leu Gin Met Gin His Ala Ser Met Pro Met 

35 40 45 

Ala Ala Ala Lys Gin Cys Lys Gly He Val Asp Cys He Val Arg 

50 55 60 

He Pro Lys Asp Gin Gly Val Leu Ser Phe Trp Arg Gly Asn Leu 

65 70 75 

Ala Asn Val He Arg Tyr Ser Pro Thr Gin Ala Leu Asn Phe Ala 

80 85 90 

Phe Lys Asp Lys Tyr Lys Gin He Phe Leu Ala Gly Val Asp Lys 

95 100 "* 105 

His Thr Gin Phe Cys Arg Tyr Phe Ala Gly Asn\ Leu Ala Ser Gly 
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110 






Gly 


Thr 


Ala 


Val 


Val 
125 


Tyr 


Pro Leu 


Ala 


Ala 


Asp Val 


Gly 


Lys 


Ser Gly 










140 






Leu 


Gly 


Asp 


Cys 


Leu 
155 


Val 


Lys lie 


Gly 


Leu 


Tyr 


Gin 


Gly 
170 


Phe 


Ser Val 


Tyr 


Gin 


Ala 


Ala 


Tyr 
185 


Phe 


Arg Val 


Phe 


Pro 


Asp 


Pro 


Lys 
200 


Asn 


Thr His 


Ala 


Gin 


Thr 


Val 


Thr 
215 


Ala 


Val Ala 



<210> 12 

<211> 461 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 7483595CD1 

<400> 12 



Met 


Glu 


Thr 


Leu 


Leu 


Phe 


Leu 


Glu 


1 








5 








Gly 


Leu 


Lys 


Thr 


Phe 


Tyr 


Pro 


Thr 










20 








Leu 


He 


Met 


Gly Val 


Leu 


Gly Leu 










35 








Pro 


Glu 


Ser 


Ala 


Met 


Ser 


Ala 


Tyr 










50 








lie 


Met 


Leu 


Ser 


Gin 


Leu 


Thr 


Phe 










65 








Phe 


His 


Ala 


Gly 


Pro 


He 


Ser 


Phe 










80 








Cys 


Val 


Ala 


Leu 


Pro 


Lys 


Ala 


Asn 










95 








Leu 


Thr 


Val 


Val 


Val 


Ala 


Leu 


Arg 










110 








Ser 


Phe 


Asn 


Gin 


Tyr 


Pro 


He 


Glu 










125 








He 


He 


Leu 


Gin 


Ala 


Phe 


Ser 


Leu 










140 








Leu 


He 


Phe 


Leu 


Gly 


Lys 


Lys 


He 










155 








Val 


Asn 


Ser 


Asn 


Gin 


Asp 


Leu 


He 










170 








Val 


Ser 


Ser 


Phe 


Phe 


Arg 


Ser 


Cys 










185 








Arg 


Thr 


He 


He 


Gin 


Asp 


Lys 


Ser 










200 








He 


Ser 


Gly Arg 


Arg 


Arg 


Ala 


Lys 










215 








Pro 


Asn 


Thr 


Asn 


He 


Tyr 


Arg 


Ser 










230 








He 


Thr 


He 


Pro 


Gly Val 


Lys 


He 










245 








Thr 


Phe 


Val 


Asn 


Val 


Tyr 


Tyr 


Leu 










260 








Val 


Asp 


Met 


Val 


Lys 


Val 


Pro 


Leu 





115 










120 


Asp 


Phe 


Thr 


Arg 


Thr Arg 


Leu 




130 










135 


Thr 


Glu 


Arg 


Glu 


Phe 


Arg 


Gly 




145 










150 


Ser 


Lys 


Ser 


Asp 


Gly 


He 


Arg 




160 










165 


Ser 


Val 


Gin 


Ala 


He 


He 


He 




175 










180 


Tyr 


Asp 


Thr 


Ala 


Asn Gly 


Met 




190 










195 


He 


Leu 


Val 


Ser 


Trp Met 


Thr 




205 










210 


Gly Val 


Leu 


Ser 










220 












Ser 


Ala 


He 


Gly 


Ala 


He 


Ala 




10 










15 


Leu 


Val 


Phe 


Ser 


Ser 


Leu 


Gin 




25 










30 


Gly 


Phe 


He 


Ala 


Thr 


Tyr 


Leu 




40 










45 


Leu 


Ala 


Ala 


Val 


Ala 


Leu 


His 




55 










60 


He 


Phe 


Gly 


He 


Met 


He 


Ser 




70 










75 


Phe 


Tyr 


Asp 


He 


He 


Asn 


Tyr 




85 










90 


Ser 


Thr 


Ser 


He 


Leu 


Val 


Phe 




100 










105 


He 


Asn 


Lys 


Cys 


He 


Arg 


lie 




115 










120 


Phe 


Pro 


Met 


Glu 


Leu 


Phe 


Leu 




130 










135 


Ser 


Leu 


Val 


Ser 


Ser 


Phe 


Leu 




145 










150 


Ala 


Ser 


Leu 


His 


Asn 


Tyr 


Ser 




160 










165 


Ala 


He 


Gly 


Leu 


Cys 


Asn 


Val 




175 










180 


Val 


Phe 


Thr 


Gly 


Ala 


He 


Ala 




190 










195 


Gly Gly 


Ser 


Thr 


Thr 


Val 


Cys 




205 










210 


He 


Leu 


Leu 


Leu 


Gly Gin 


He 




220 










225 


He 


Asn 


Asp 


Tyr 


Arg 


Glu 


He 




235 










240 


Phe 


Gin 


Cys 


Cys 


Ser 


Ser 


He 




250 










255 


Lys 


His 


Lys 


Leu 


Leu 


Lys 


Glu 




265 










270 


Lys 


Glu 


Glu 


Glu 


He 


Phe 


Ser 
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275 



Leu 


Phe Asn 


Ser 


Ser 


Asp 


Thr 








290 






Arg 


Cys Phe 


Cys 


Asn 


Cys 


Asp 








305 






Leu 


Tyr Thr 


Glu 


Arg 


Phe 


Glu 








320 






Ser 


Val Asn 


Leu 


lie 


His 


Cys 








335 






Ser 


Gin Thr 


Ala 


Ser 


Glu 


Asp 








350 






Val 


Ser Gin Lys 


Asn 


Gin 


Gly 








365 






Val 


Trp Leu 


Pro 


Asn 


Asn 


Ser 








380 






Pro 


Asp Val Ala 


Glu 


Ser 


Gin 








395 






Ser 


Asp Ala 


Ser 


Leu 


Leu 


Pro 








410 






Phe 


Ser Met 


Val 


His 


Tyr 


Val 








425 






Arg 


Gin Val 


Ser 


Thr 


Glu 


Glu 








440 






Leu 


Leu Pro 


Ser 


Gin 


Pro 


His 



455 



280 




285 


Asn Leu Gin Gly Gly Lys 


He 


Cys 


295 




300 


Asp Leu Glu Pro Leu Pro 


Arg 


He 


310 




315 


Asn Lys Leu Asp Pro Glu 


Ala 


Ser 


325 




330 


Ser His Phe Glu Ser Met 


Asn 


Thr 


340 




345 


Gin Val Pro Tyr Thr Val 


Ser 


Ser 


355 




360 


Gin Gin Tyr Glu Glu Val Glu Glu 


370 




375 


Ser Arg Asn Ser Ser Pro Gly Leu 


385 




390 


Gly Arg Arg Ser Leu lie 


Pro 


Tyr 


400 




405 


Ser Val His Thr He He 


Leu 


Asp 


415 




420 


Asp Ser Arg Gly Leu Val 


Val 


Leu 


430 




435 


Ala Leu Ala Gly Ala Leu 


He 


Pro 


445 




450 


Pro Asp Pro Asp 






460 







<210> 13 
<211> 502 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 3788427CD1 



<400> 13 

Met Ser He Val Arg Leu Ser Val His Ala Lys Trp lie Met Gly 
1 5 10 15 

Lys Val Thr Gly Thr Lys. Met Gin Lys Thr Ala Lys Val Arg Val 
20 25 30 

He Arg Leu Val Leu Asp Pro His Leu Leu Lys Tyr Tyr Asn Lys 
35 40 45 

Gin Lys Thr Tyr Phe Ala His Asn Ala Leu Gin Gin Cys Thr He 
50 55 60 

Gly Asp lie Val Leu Leu Lys Ala Leu Pro Val Pro Arg Thr Lys 
65 70 75 

His Val Lys His Glu Leu Ala Glu lie Val Phe Lys Val Gly Lys 
80 85 90 

Leu Val Asp Pro Val Thr Gly Lys Pro Arg Ala Gly Thr Thr Tyr 
95 100 105 

Leu Glu Ser Pro Leu Ser Ser Glu Thr Thr Gin Gly Val Asp Gly 
HO 115 120 

Ala Ser Arg Pro Ser Arg Gly Pro Ala Pro Cys Arg Ala Gly Pro 
125 130 135 

Gly Ala Arg Arg Leu Arg Pro Trp Pro Glu Ser Pro Arg Pro Glu 
140 145 150 

Pro Arg Gly Leu Pro Gly Pro Gly Arg Gly Ser Met Ala Thr Trp 
155 160 165 

Arg Arg Asp Gly Arg Leu Thr Gly Gly Gin Arg Leu Leu Cys Ala 
170 175 180 

Gly Leu Ala Gly Thr Leu Ser Leu Ser Leu Thr Ala Pro Leu Glu 
185 190 195 

Leu Ala Thr Val Leu Ala Gin Val Gly Val Val Arg Gly His Ala 
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tjiy 






200 






205 






210 


Arg 


Pro 


Trp 


Ala Thr 


Gly 


His Arg Val Trp Arg Ala Glu Gly 






Ala 




215 






220 






225 


Leu 


Arg 


Leu 


Trp Lys Gly Asn Ala Val Ala Cys 


Leu 


Arg 


Leu 










230 






235 




240 




Pro 


Cys 


Ser 


Ala Val 
245 


Gin 


Leu Ala 


Ala Tyr Arg 
250 


Lys 


Phe 


Val 
255 


val 


Leu 


pne 


Tnr 


Asp Asp 


Leu 


Gly His 


He Ser Gin 


Trp 


Ser 


Ser 


He 




Ala 


Gly 


260 






265 




270 


Met 


Ser Leu Ala Gly Met 


Val Ser Thr 


He 


Val 


Thr 










275 






280 






285 


Tyr 


Pro 


Thr 


Asp 


Leu He 


Lys 


Thr Arg 


Leu He Met 


Gin 


Asn 


He 




Glu 






290 






295 






300 


Leu 


Pro 


Ser 


Tyr Arg Gly Leu Leu 


His Ala Phe 


Ser 


Thr 


He 










305 






310 






315 


Tyr 


Gin 


Gin 


Glu 


Gly Phe 


Leu 


Ala Leu 


Tyr Arg Gly Val 


Ser 


Leu 










320 






325 






330 


Thr 


Val 


Val 


Gly Ala Leu 


Pro 


Phe Ser 


Ala Gly Ser 


Leu 


Leu 


Val 










335 






340 






345 


Tyr 


Met 


Asn 


Leu 


Glu Lys 


He 


Trp Asn Gly Pro Arg Asp Gin Phe 










350 






355 






360 


Ser 


Leu 


Pro 


Gin 


Asn Phe 


Ala 


Asn Val 


Cys Leu Ala 


Ala 


Ala 


Val 


Thr 








365 






370 






375 


Gin 


Thr 


Leu 


Ser Phe 


Pro 


Phe Glu 


Thr Val Lys 


Arg 


Lys 


Met 


Gin 








380 






385 




390 


Ala 


Gin 


Ser 


Pro Tyr 


Leu 


Pro His 


Ser Gly Gly Val 


Asp Val 


His 








395 






400 






405 


Phe 


Ser Gly Ala Val 


Asp 


Cys Phe 


Arg Gin He 


Val 


Lys 


Ala 


Gin 








410 






415 




420 


Gly Val 


Leu 


Gly Leu 


Trp 


Asn Gly 


Leu Thr Ala 


Asn 


Leu 


Leu 










425 






430 






435 


Lys 


He 


Val 


Pro 


Tyr Phe 
440 


Gly 


He Met 


Phe Ser Thr 
445 


Phe 


Glu 


Phe 
450 


Cys 


Lys 


Arg 


He 


Cys Leu 
455 


Tyr 


Gin Asn 


Gly Tyr He 
460 


Leu 


Ser 


Pro 
465 


Leu 


Ser 


Tyr 


Lys 


Leu Thr 
470 


Pro 


Gly Val 


Asp Gin Ser 
475 


Leu 


Gin 


Pro 
480 


Gin 


Glu 


Leu 


Arg 


Glu Leu 


Lys 


Lys Phe 


Phe Lys Thr 


Arg 


Lys 


Leu 










485 






490 


495 


Lys 


Pro 


Lys 


Lys 


Pro Thr 
500 


Leu 













<210> 14 

<211> 261 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc__f eature 

<223> Incyte ID No: 6972455CD1 



<400> 14 



Met 


Thr 


Lys 


Arg 


Tyr 


Ser 


Ala 


Leu 


Leu 


Thr 


Ala 


Leu 


Phe 


Ala 


Ser 


1 








5 










10 










15 


Leu 


Met 


Leu 


Ser 


Gin 


Ala 


Pro 


Ala 


Gin 


Ala 


Ser Gly Leu Asp Asp 










20 










25 










30 


He 


Val 


Ala 


Arg 


Gly Thr 


Leu 


Lys 


Val 


Ala 


Val 


Pro 


Gin 


Asp 


Phe 










35 










40 








45 


Pro 


Pro 


Phe 


Gly 


Ser 


Val 


Gly 


Pro 


Asp 


Met 


Gin 


Pro 


Arg 


Gly 


Leu 




He 






50 










55 






60 


Asp 


Asp 


Thr 


Ala 


Lys 


Leu 


Leu 


Ala 


Asp 


Gin 


Leu 


Lys 


Val 


Lys 










65 










70 








75 


Leu 


Glu 


Leu 


Thr 


Pro 


Val 


Asn 


Ser 


Thr 


Asn 


Arg 


He 


Pro 


Phe 


Leu 
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80 






Thr 


Thr 


Gly Lys 


Val 


Asp Leu 


Val 








95 






Pro 


Glu 


Arg Ala 


Lys 


Val He 


Asp 








110 






Phe 


Tyr 


Leu Ala 


Val 


Phe Gly 


Pro 








125 






Leu 


Asp 


Asp Leu 


Lys 


Gly Lys Thr 








140 






He 


Glu 


Asp He 


Glu 


Leu Thr 


Ala 








155 






He 


Lys 


Arg Phe 


Glu 


Asp Asn Asn 








170 






Ala 


Gly 


Gin Val 


Asp 


Leu He 


Ala 








185 






Ala 


He 


Ser Glu 


Arg 


Asn Pro 


Lys 








200 






Lys 


Leu 


Lys Asp 


Ser 


Pro Val 


Tyr 








215 






Pro 


Ala 


Leu Leu 


Glu 


Lys Val 


Asn 








230 






Ala 


Asp 


Gly Ser 


Leu 


Gly Lys 


Asn 








245 






Pro 


Leu 


Pro Ala 


Asp 


Leu 










260 







<210> 15 
<211> 570 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: 8077668CD1 

<400> 15 

Met Tyr Asp Pro Pro Lys Val Asn 

1 5 
His Ser Asp Glu Ser Val Glu He 
20 

Gly Phe Asp Gin Gly Arg Trp He 
35 

Ala Arg Arg His Thr Gly Leu Ala 
50 

Arg Val Cys Val Gly Cys Gin Gly 
65 

Lys Arg Glu Gly Arg Gly Ser Ala 
80 

Asp Gly Gin Gin Gly Glu Arg Arg 
95 

Asp Ala Val Thr Phe Gin Ser Ser 
110 

He Pro Ala Cys Trp Arg Met Thr 
125 

Leu Phe His Glu Ala Ala Gin Gin 
140 

Pro Trp Trp Lys Thr Gin Leu Phe 
155 

Gly Thr Trp Asp Gly Val Phe Thr 
170 

Gly Val Val Leu Phe Leu Arg Thr 
185. 

Gly Val Leu Leu Gly Met Phe Leu 





85 










90 


He 


Ser 


Ser 


Leu 


Gly 


Lys 


Asn 




100 










105 


Phe 


Ser 


Asn 


Ala 


Tyr 


Ala 


Pro 




115 










120 


Pro 


Asp 


Ala 


Ala 


He 


Ala 


Ser 




130 










135 


He 


Ser 


Val 


Thr 


Arg 


Gly 


Ala 




145 










150 


Val 


Ala 


Pro 


Lys 


Glu 


Ala 


Thr 




160 










165 


Ser 


Thr 


He 


Ala 


Ala 


Tvr 


Leu 




175 










180 


Ser Gly 


Asn 


Val 


Val 


Met 


Val 




190 










195 


Arg Val 


Pro 


Ala 


Leu 


Lys 


Val 




9n^ 

Z U J 










210 


val 




Val 


Asn 


Lys 


Asn 


Glu 




990 

z Z VJ 










225 




lie 


Leu 


Val 


Ala 


Ala 


Lys 














240 






Gin 


Trp 


Leu 


Lys 


Glu 




950 










255 


Lys 


Lys 


Asn 


Asp 


Ser 


Ala 


Asp 




10 










15 


Gin 


Ser 


Ala 


Leu 


His 


He 


Cys 




25 










30 


Leu 


Ser 




Arg 


Arg 


Ser 


Pro 




40 










45 


Pro 


Val 


Leu 


Glv 


Val 


Arg 


Val 




55 










60 


Ala 


Arg 


Val 


Pro 


Ala 


Trr> 


Pro 




70 










75 


Glu 


Glu 


Pro 


Val 


Glu 


Arcr 


Lvs 




85 








Thr 


90 


Gly Glu 


Ala 


Ser 


Leu 


Lvs 




100 










105 


Leu 


Pro 


Val 


Gin 


He 


Thr 


Aro 




115 










120 


Gin 


Met 


Ser 


Gin 


Val 


Gin 


Glu 




130 










135 


Asp 


Ala 


Leu 


Ala 


Gin 


Pro 


Gin 




145 










150 


Met 


Trp 


Glu 


Pro 


Val 


Leu 


Phe 




160 










.165 


Ser Cys 


Met 


lie 


Asn 


He 


Phe 




175 










180 


Gly Trp 


Leu 


Val 


Gly 


Asn 


Thr 




190 










195 


Val 


Ser 


Phe 


Val 


He 


Leu 


Val 
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Ala Leu 






200 






205 










210 


Val 


Thr 


Val 


Leu Ser 


Asp 


He Gly Val 


Gly Glu 


Arg 


Ser 








215 






220 








225 


Ser lie Gly Ser Gly Gly Val 


Tyr 


Ser Met 


He 


Ser 


Ser 


Val 


Leu 








230 






235 










240 


Gly Gly Gin 


Thr 


Gly Gly Thr 


lie 


Gly Leu 


Leu 


Tyr Val 


Phe 


Gly 








245 






250 










255 


Gin Met 


Tyr 


He 


Thr 


Gly Phe 


Ala 


Glu Ser 


He 


Ser 


Asp 


Leu 


Leu 


Gly . Leu 






260 






265 








270 


Gly Asn 


He 


Trp Ala 


Val 


Arg Gly He 


Ser 


Val 


Ala 


Val 








275 






280 










285 


Leu Leu 


Ala 


Leu 


Leu 


Gly He 


Asn 


Leu Ala 


Gly Val 


Lys 


Trp 


He 


lie Arg 






290 






295 






300 


Leu 


Gin 


Leu 


Leu Leu 


Leu 


Phe Leu 


Leu 


Ala 


Val 


Ser 


Thr 








305 






310 










315 


Leu Asp 


Phe 


Val 


Val 


Gly Ser 


Phe 


Thr His 


Leu 


Asp 


Pro 


Glu 


His 


Gly Phe 






320 






325 








330 


lie Gly Tyr 


Ser Pro 


Glu 


Leu Leu 


Gin 


Asn 


Asn 


Thr 


Leu 








335 






340 










345 


Pro Asp 


Tyr 


Ser 


Pro 


Gly Glu 


Ser 


Phe Phe 


Thr 


Val 


Phe 


Gly Val 


Phe Phe 






350 






355 










360 


Pro 


Ala 


Ala 


Thr Gly Val 


Met Ala 


Gly 


Phe 


Asn 


Met 


Gly 


Gly Asp 






365 






370 










375 


Leu 


Arg 


Glu 


Pro Ala 


Ala 


Ser He 


Pro 


Leu 


Gly 


Ser 


Leu 


Ala Ala 






380 






385 








390 


Val 


Gly 


He 


Ser Trp 


Phe 


Leu Tyr He Gly Tyr Arg 


Ser 








395 






400 










405 


His Gly Arg 


Leu 


Gin 


His Gly Gly Arg Pro 


Gin 


Gly Ala 


Cys 


Arg 


Gin His 






410 






415 








420 


Ser 


Pro 


Gly 


Leu Pro 


Gly 


Ser Cys 


Trp 


His 


Leu 


Val 


Ser 








425 






430 










435 


Ser Pro 


His 


Gly Asn 


Gin Ser 


Glu 


Leu Glu 


Ser 


Gly Arg Glu Asn 








440 






445 










450 


Pro Arg 


Gin 


Lys 


Arg 


Asp Cys 


Pro 


Phe Cys 


Ser 


Glu 


Ser 


Ala 


Pro 


Thr Val 






455 






460 










465 


Thr 


Asn 


Leu 


Phe His 


He 


Pro He 


Leu 


Leu 


Leu 


Thr 


Leu 








470 






475 










480 


Gly Gly Leu 


Glu 


Asn 


Ser Arg 


Leu 


Val Gly 


Ser 


Asp 


Thr 


Ala 


Leu 


Ser Cys 






485 






490 










495 


Ala 


Leu 


Pro 


Gly Asn Ala Ala Glu 


Glu 


Glu 


Cys 


Arg 


Asp 


Pro Asp 


His 




500 






505 






510 


Arg 


Ser 


Asn Pro 


Phe 


Phe Pro 


Phe 


Leu Gly Leu 


Pro 








515 






520 










525 


Pro Pro 


Ser 


Leu 


Pro 


His Pro 


His 


Phe Glu 


Leu Gly Asp 


Phe 


Phe 


Asn His 






530 






535 










540 


Arg 


Val 


Ser 


Arg He 


Thr 


Gly Gly Ser 


He 


Lys 


Met 


Lys 


Ser Phe 






545 






550 








555 


Ala 


He 


Arg Arg He Gly Ala Ala 


Ser 


Gin 


Leu 


Cys 


Arg 








560 






565 








570 



<210> 16 

<211> 1033 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 55120485CD1 

<400> 16 

Met Glu Glu Asn Ser Lys Lys Asp His Arg Ala Leu Leu Asn Gin 
15 10 15 
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Gly 


Glu 


Glu 


Asp Glu 


Leu 


Glu 


Val 


Phe Gly 


Tyr 


Arg 


Asp 


His 


Asn 










20 










25 










30 


Val 


Arg 


Lys 


Ala 


Phe Cys 


Leu 


Val 


Ala 


Ser 


Val 


Leu 


Thr 


Cys 


Gly 










35 










40 










45 


Gly 


Leu 


Leu 


Leu 


Val 


Phe Tyr Trp Arg Pro 


Gin 


Trp 


Arg 


Val 


Trp 










50 










55 










60 


Ala 


Asn 


Cys 


lie 


Pro Cys 


Pro 


Leu 


Gin 


Glu 


Ala 


Asp 


Thr 


Val 


Leu 










65 










70 










75 


Leu 


Arg 


Thr 


Thr Asp Glu 


Phe Gin Arg Tyr 


Met 


Arg 


Lys 


Lys 


Val 










80 










85 










90 


Phe 


Cys 


Leu 


Tyr 


Leu 


Tyr 


Thr 


Leu 


Lys 


Phe 


Pro 


Val 


Ser 


Lys 


Lys 










95 










100 










105 


Trp 


Glu 


Glu 


Ser Leu Val Ala Asp Arg His 


Ser 


Val 


He 


Asn 


Gin 










110 










115 










120 


Ala 


Leu 


He 


Lys 


Pro 


Glu 


Leu Lys 


Leu Arg 


Cys 


Leu 


Glu 


Val 


Gin 










125 










130 










135 


Lys 


He 


Arg 


Tyr 


Val 


Trp 


Asn 


Asp 


Leu 


Glu 


Lys 


Arg 


Phe 


Gin 


Lys 










140 










145 










150 


Val 


Gly 


Leu 


Leu 


Glu 


Asp 


Ser 


Asn 


Ser Cys 


Ser 


Asp 


He 


His 


Gin 










155 










160 










165 


Thr 


Phe 


Gly 


Leu 


ciy 


Leu 


Thr 


Ser 


Glu 


Glu 


Gin 


Glu 


Val 


Arg 


Arg 










170 










175 










180 


Leu 


Val 


Cys 


Gly 


Pro 


Asn 


Ala 


He 


Glu 


Val 


Glu 


He 


Gin 


Pro 


lie 










185 










190 










195 


Trp 


Lys 


Leu 


Leu 


Val 


Lys 


Cln 


Val 


Leu 


Asn 


Pro 


Phe 


Tyr 


Val 


Phe 










200 










205 










210 


Gin 


Ala 


Phe 


Thr 


Leu 


Thr 


Leu 


Trp 


Leu 


Ser 


Gin 


Gly 


Tyr 


He 


Glu 










215 










220 










225 


Tyr 


Ser 


Val 


Ala 


He 


lie 


lie 


Leu 


Thr 


Val 


lie 


Ser 


He 


Val 


Leu 










230 










235 










240 


Ser 


Val 


Tyr 


Asp 


Leu 


Arg 


Gin 


Gin 


Ser 


Val 


Lys 


Leu 


His 


Asn 


Leu 










245 










250 










255 


Val 


Glu 


Asp 


His 


Asn 


Lys 


Val 


Gin 


Val 


Thr 


He 


He 


Val 


Lys 


Asp 










260 










265 










270 


Lys 


Gly 


Leu 


Glu 


Glu 


Leu 


Glu 


Ser Arg Leu 


Leu 


Val 


Pro 


Gly 


Asp 










275 










280 










285 


He 


Leu 


lie 


Leu 


Pro Gly 


Lys 


Phe 


Ser 


Leu 


Pro 


Cys 


Asp 


Ala 


Val 










290 










295 








300 


Leu 


He 


Asp 


Gly Ser Cys 


Val 


Val 


Asn 


Glu 


Gly 


Met 


Leu 


Thr 


Gly 










305 










310 










315 


Glu 


Ser 


He 


Pro 


Val 


Thr 


Lys 


Thr 


Pro 


Leu 


Pro 


Gin 


Met 


Glu 


Asn 










320 










325 










330 


Thr 


Met 


Pro 


Trp 


Lys 


Cys 


His 


Ser 


Leu 


Glu 


Asp 


Tyr 


Arg 


Lys 


His 










335 










340 










345 


Val 


Leu 


Phe 


Cys 


Gly Thr 


Glu 


Val 


He 


Gin 


Val 


Lys 


Pro 


Ser 


Gly 










350 










355 










360 


Gin 


Gly 


Pro 


Val 


Arg 


Ala 


Val 


Val 


Leu 


Gin 


Thr 


Gly 


Tyr 


Asn 


Thr 










365 










370 










375 


Ala 


Lys 


Gly 


Asp 


Leu 


Val 


Arg 


Ser 


He 


Leu 


Tyr 


Pro 


Arg 


Pro 


Leu 










380 










385 










390 


Asn 


Phe 


Lys 


Leu Tyr 


Ser Asp Ala 


Phe 


Lys 


Phe 


lie 


Val 


Phe 


Leu 










395 










400 










405 


Ala 


Cys 


Leu 


Gly Val Met 


Gly Phe 


Phe Tyr 


Ala 


Leu 


Gly 


Val 


Tyr 










410 










415 










420 


Met 


Tyr 


His 


Gly Val 


Pro 


Pro 


Lys 


Asp Thr 


Val 


Thr 


Met 


Ala 


Leu 










425 










430 










435 


He 


Leu 


Leu 


Thr 


Val 


Thr 


Val 


Pro 


Pro 


Val 


Leu 


Pro 


Ala 


Ala 


Leu 










440 










445 










450 


Thr 


He 


Gly 


Asn 


Val 


Tyr 


Ala 


Gin 


Lys 


Arg 


Leu 


Lys 


Lys 


Lys 


Lys 










455 










460 










465 


lie 


Phe 


Cys 


He 


Ser 


Pro Gin Arg 


He 


Asn 


Met 


Cys 


Gly 


Gin 


lie 










470 










475 










480 


Asn 


Leu 


Val 


Cys 


Phe Asp Lys Thr Gly Thr 


Leu 


Thr 


Glu 


Asp 


Gly 
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485 










490 










495 


Leu 


Asp 


Leu Trp Gly Thr Val 


Pro 


Thr 


Ala 


Asp Asn 


Cys 


Phe 


Gin 




Ala 


His Ser 


500 










505 








510 


Glu 


Phe 


Ala 


Ser Gly Gin Ala 


Val 


Pro 


Trp 


Ser 


Pro 






Ala Ala 


515 










520 








525 


Leu 


Cys 


Met 


Ala 


Ser 


Cys 


His 


Ser 


Leu 


He 


Leu 


Leu 


Asn 






He Gin 


530 










535 










540 


Gly Thr 


Gly Asp 


Pro 


Leu 


Asp 


Leu 


Lys 


Met 


Phe 


Glu 


Gly 




Ala 




545 










550 










555 


Thr 


Trp Lys 


Met 


Glu 


Asp 


Cys 


He 


Val 


Asp 


Ser 


Cys 


Lys 


Phe 








560 










565 








570 


Gly 


Thr 


Ser Val 


Ser 


Asn 


He 


He 


Lys 


Pro 


Gly 


Pro 


Lys 


Ala 


Ser 








575 










580 








585 


Lys 


Ser 


Pro Val 


Glu 


Ala 


He 


He 


Thr 


Leu 


Cys 


Gin 


Phe 


Pro 


Phe 


Ser 






590 










595 










600 


Ser 


Ser Leu 


Gin 


Arg Met 


Ser 


Val 


He 


Ala 


Gin 


Leu 


Ala 


Gly 


Glu 






605 










610 










615 


Asn 


His Phe 


His 


Val 


Tyr 


Met 


Lys 


Gly Ala 


Pro 


Glu 


Met 


Val 


Ala 






620 










625 










630 


Arg 


Phe Cys 


Arg 


Ser 


Glu 


Thr 


Val 


Pro 


Lys 


Asn 


Phe 


Pro 


Gin 


Glu 






635 










640 










645 


Leu 


Arg Ser 


Tyr 


Thr 


Val 


Gin 


Gly 


Phe 


Arg 


Val 


He 


Ala 


Leu 


Ala 


His 




650 










655 










660 


Lys Thr 


Leu 


Lys Met Gly Asn Leu 


Ser 


Glu 


Val 


Glu 


His 




Ala 




665 










670 










675 


Leu 


Arg Glu 


Lys 


Val 


Glu 


Ser 


Glu 


Leu 


Thr 


Phe 


Leu 


Gly 


Leu 




He 




680 










685 








690 


Leu 


Met Glu 


Asn 


Arg 


Leu 


Lys 


Lys 


Glu 


Thr 


Lys 


Leu 


Val 


Leu 








695 










700 








705 


Lys 


Glu 


Leu Ser 


Glu 
710 


Ala 


Arg 


He 


Arg 


Thr 
715 


Val 


Met 


He 


Thr 


Gly 
720 


Asp 


Asn 


Leu Gin 


Thr 


Ala 


He 


Thr 


Val 


Ala 


Lys 


Asn 


Ser 


Glu 


Met 


He 






725 










730 










735 


Pro 


Pro Gly Ser 


Gin 


Val 


He 


He 


Val 


Glu 


Ala 


Asp 


Glu 


Pro 


Glu 


Glu 




740 










745 








750 


Phe Val 


Pro 
755 


Ala 


Ser 


Val 


Thr 


Trp 
760 


Gin 


Leu 


Val 


Glu 


Asn 
765 


Gin 


Glu 


Thr Gly Pro Gly Lys Lys Glu He 


Tyr 


Met 


His 


Thr 


Gly 








770 










775 










780 


Asn 


Ser 


Ser Thr 


Pro 


Arg Gly Glu 


Gly Gly 


Ser 


Cys 


Tyr 


His 


Phe 


Ala 






785 










790 








795 


Met 


Ser Gly 


Lys 


Ser 


Tyr 


Gin 


Val 


He 


Phe 


Gin 


His 


Phe 


Asn 








800 










805 








810 


Ser 


Leu 


Leu Pro 


Lys 


He 


Leu 


Val 


Asn 


Gly Thr 


Val 


Phe 


Ala 


Arg 








815 










820 










825 


Met 


Ser 


Pro Gly Gin 


Lys 


Ser 


Ser 


Leu 


He 


Glu 


Glu 


Phe 


Gin 


Lys 








830 










835 










840 


Leu 


Asn 


Ala Cys 


Thr 
845 


Val 


Gin 


Asn 


Glu 


Ser 
850 


He 


Ser 


Glu 


Leu 


Thr 
855 


Met 


Ser 


Pro Thr 


Ala 


Pro 


Glu 


Lys 


Met 


Glu 


Ser 


Asn 


Ser 


Thr 


Phe 


Thr 






860 










865 










870 


Ser 


Phe Glu 


Asn 


Thr 


Thr 


Val 


Trp 


Phe 


Leu 


Gly Thr 


He 


Asn 








875 










880 










885 


Cys 


He 


Thr Val 


Ala 


Leu 


Val 


Phe 


Ser 


Lys 


Gly Lys 


Pro 


Phe 


Arg 


Gin 






890 










895 










900 


Pro 


Thr Tyr 


Thr 


Asn 


Tyr 


He 


Phe 


Val 


Leu 


Val 


Leu 


He 


He 


Gin 






905 










910 










915 


Leu Gly Val 


Cys 


Leu 


Phe 


He 


Leu 


Phe 


Ala 


Asp 


He 


Pro 


Glu 


Leu 






920 










925 








930 


Tyr 


Arg Arg 


Leu 


Asp 


Leu 


Leu 


Cys 


Thr 


Pro 


Val 


Leu 


Trp 


Arg 


Ala 






935 










940 








945 


Ser 


He Val 


He 
950 


Met 


Leu 


Ser 


Leu 


Asn 
955 


Phe 


He 


Val 


Ser 


Leu 
960 
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Val Ala Glu 


Glu 


Ala 


Val He Glu Asn Arg 


Ala Leu Trp Met Met 






965 






970 




975 


lie Lys Arg 


Cys 


Phe 


Gly Tyr" Gin Ser Lys 


Ser Gin Tyr Arg He 






980 






985 




990 


Trp Gin Arg 


Asp 


Leu 


Ala Asn Asp 


Pro Ser 


Trp Pro Pro Leu 


Asn 






995 






1000 


1005 


Gin Thr Ser 


His 


Ser 


Asp Met 


Pro 


Glu Cys 


Gly Arg Gly Val 


Ser 




1010 






1015 


1020 


Tyr Ser Asn 


Pro 


Val 


Phe Glu 


Ser 


Asn Glu 


Glu Gin Leu 




1025 






1030 






<210> 17 
















<211> 496 
















<212> PRT 
















<213> Homo sapiens 












<220> 
















<221> misc_f eature 












<223> Incyte ID 


No: 


3112883CD1 








<400> 17 
















Met Asp Cys 


Tyr Arg 


Thr Ser 


Leu 


Ser Ser 


Cot* Tm T 1 P TKry 
ijCi, XXkJ X JL t* 1 J 1 


Pro 


1 




5 






10 




15 


Thr Val lie 


Leu 


Cys 


Leu Phe 


Gly Phe Phe 


Cpy Mot" Mot" A ttt 
ncL ncu /uy 


C X \J 






20 






25 




30 


Ser Glu Pro 


Phe 


Leu 


He Pro 


Tyr 


Leu Ser 




Asn 






35 






40 




45 


Leu Thr Ser 


Ala 


Glu 


He Thr 


Asn 


Glu lie 


pVio Pro \7zk 1 Ttt> 
rile ti.u vctx iiy 


Thr 






50 






55 




60 


Tyr Ser Tyr 


Leu 


Val 


Leu Leu 


Leu 


Pro Val 


PV>o Va 1 T.oii Thr 
rue vctx ucu iiu 


Asp 






65 






70 




75 


Tyr Val Arg 


Tyr 


Lys 


Pro Val 


He 


He Leu 


fl"] v-i CZ"] \r T 1 o Cot* 


Phe 






80 






85 




90 

? \j 


lie lie Thr 


Trp Leu 


Leu Leu 


Leu 


Phe Gly 


Clin C2~\\r \7a ~\ Ta/q 
olli Vsxy vctx Jjyb 


Thr 

1 XiX 






95 






100 




105 


Met Gin Val 


Val 


Glu 


Phe Phe 


Tyr Gly Met 


Val Thr Ala Ala 
vctx 1 1 ix nia nia 


Glu 






110 






115 




120 


Val Ala Tyr 


Tyr 


Ala 


Tyr He 


Tyr 


Ser Val 


Val Ser Pro Glu 


His 






125 






130 




135 


Tyr Gin Arg 


Val 


Ser 


Gly Tyr 


Cys 


Arg Ser 


Val Thr Leu Ala 


Ala 






140 






145 




150 


Tyr Thr Ala 


Gly Ser 


Val Leu 


Ala 


Gin Leu 


Leii Val Ser Leu 


Ala 






155 






160 




165 


Asn Met Ser 


Tyr 


Phe 


Tyr Leu 


Asn 


Val He 


Ser Leu Ala Ser 


Val 






170 






175 




180 


Ser Val Ala 


Phe 


Leu 


Phe Ser 


Leu 


Phe Leu 


Pro Met Pro Lys 


Lvs 






185 






190 




195 


Ser Met Phe 


Phe 


His 


Ala Lys 


Pro 


Ser Arg 


Glu He Lys Lys 


Ser 






200 






205 




210 


Ser Ser Val 


Asn 


Pro 


Val Leu 


Glu 


Glu Thr 


His Glu Gly Glu 


Ala 






215 






220 




225 


Pro Gly Cys 


Glu 


Glu 


Gin Lys 


Pro 


Thr Ser 


Glu He Leu Ser 


Thr 






230 






235 




240 


Ser Gly Lys 


Leu 


Asn 


Lys Gly 


Gin 


Leu Asn 


Ser Leu Lys Pro 


Ser 






245 






250 




255 


Asn Val Thr 


Val 


Asp 


Val Phe 


Val 


Gin Trp 


Phe Gin Asp Leu 


Lys 






260 






265 




270 


Glu Cys Tyr 


Ser 


Ser 


Lys Arg 


Leu 


Phe Tyr 


Trp Ser Leu Trp 


Trp 






275 






280 




285 


Ala Phe Ala 


Thr 


Ala 


Gly Phe 


Asn Gin Val 


Leu. Asn Tyr Val 


Gin 






290 






295 




300 


He Leu Trp 


Asp Tyr 


Lys Ala 


Pro 


Ser Gin 


Asp Ser Ser He 


Tyr 






305 






310 




315 
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Asn 


Gly 


Ala 


Val 


Glu 
320 


Ala 


He 


Ala 


Phe 


Ala 


Val 


Gly 
335 


Tyr 


Val 


Glu 


Leu 


Ala 


Leu 


Val 
350 


Val 


Phe 


Phe 


Leu 


Met 


His 


Tyr 
365 


Thr 


Ala 


Tyr 


Leu 


He 


Phe 


Lys 
380 


Ser 


Ser 


Val 


Phe 


Gin 


He 


Ala 
395 


Val 


Asn 


Val 


Phe 


Gly 


He 


Asn 
410 


Thr 


Phe 


Met 


Thr 


Val 


He 


Val 


Val 


Asp 










425 




Ser 


He 


Gin 


Phe 


Leu 
440 


Val 


Tyr 


Gly 


He 


Phe 


Leu 


Met 
455 


Ara 


Ser 


Ser 


Gin 


Lys 


Asp 


Val 
470 


Gin 


Ser 


Val 


Ser 


His 


Pro 


Glu 
485 


Glu 


Glu 


Leu 















Ala 


Thr 


Phe 


Gly Gly Ala Val 


Ala 






325 




330 


Lys 


Val 


Asn 


Trp Asp Leu Leu 


Glv 






340 




345 


Ser 


Val 


Val Asn Ala Gly Ser 


Leu 






355 




360 


Asn 


He 


Trp 


Ala Cys Tyr Ala 


Glv 






370 




375 


Tyr 


Met 


Leu 


Leu He Thr He 


Ala 






385 




390 


Leu 


Asn 


Val 


Glu Arg Tyr Ala 


Leu 






400 




405 


He 


Ala 


Leu 


Val He Gin Thr 


lie 






415 




420 


Gin Arg Gly Leu Asn Leu Pro 


Val 






430 




4 *j c; 
*± j j 


Gly 


Ser 


Tyr 


Phe Ala Val He 


Ala 






445 




450 


Met 


Tyr 


He 


Thr Tyr Ser Thr 


Lys 






460 




465 


Pro 


Ala 


Pro 


Ser Glu Asn Pro 


Asp 






475 




480 


Ser 


Asn 


He 


He Met Ser Thr 


Lys 






490 




495 
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Met 


Thr 


Asn 


Ser 


Lys 


Gly Arg 


Ser 


He 


Thr 


Asp 


Lys 


Thr 


Ser 


Gly 


1 








5 
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15 


Gly 


Pro 


Ser 


Ser 


Gly Gly Gly 


Phe 


Val 


Asp 


Trp 


Thr 


Leu 


Arg 


Leu 










20 








25 








30 


Asn 


Thr 


He 


Gin 


Ser 
35 


Asp Lys 


Phe 


Leu 


Asn 
40 


Leu 


Leu 


Leu 


Ser 


Met 
45 


Val 


Pro 


Val 


He 


Tyr 
50 


Gin Lys 


Asn 


Gin 


Glu 
55 


Asp 


Arg 


His 


Lys 


Lys 
60 


Pro 


Asn 


Gly 


He 


Trp 
65 


Gin Asp 


Gly 


Leu 


Ser 
70 


Thr 


Ala 


Val 


Gin 


Thr 
75 


Phe 


Ser 


Asn 


Arg 


Ser 
80 


Glu Gin 


His 


Met 


Glu 
85 


Tyr 


His 


Ser 


Phe 


Ser 
90 


Glu 


Gin 


Ser 


Phe 


His 


Ala Asn 


Asn 


Gly His 


Ala 


Ser 


Ser 


Ser 


Cys 










95 








100 










105 


Ser 


Gin 


Lys 


Tyr 


Asp 
110 


Asp Tyr 


Ala 


Asn 


Tyr 
115 


Asn 


Tyr 


Cys 


Asp 


Gly 
120 


Arg 


Glu 


Thr 


Ser 


Glu 


Thr Thr 


Ala 


Met 


Leu 


Gin Asp Glu Asp 


He 










125 








130 










135 


Ser 


Ser 


Asp Gly 


Asp Glu Asp 


Ala 


He 


Val 


Glu 


Val 


Thr 


Pro 


Lys 










140 








145 










150 


Leu 


Pro 


Lys 


Glu 


Ser 
155 


Ser Gly 


He 


Met 


Ala 
160 


Leu 


Gin 


He 


Leu 


Val 
165 


Pro 


Phe 


Leu 


Leu 


Ala 


Gly Phe 


Gly 


Thr 


Val 


Ser 


Ala 


Gly Met 


Val 










170 








175 










180 


Leu 


Asp 


He 


Val 


Gin 
185 


His Trp 


Glu 


Val 


Phe 
190 


Arg 


Lys 


Val 


Thr 


Glu 
195 
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val 


Phe 


He 


Leu 


Val 


Pro 


Ala 


Leu 










200 








Glu 


Met 


Thr 


Leu 


Ala 


Ser 


Arg 


Leu 










215 








Lys 


Met 


Asp 


Ser 


Pro 


He 


Glu 


Lys 










230 








Leu 


Ala 


Leu 


Lys 


Gin 


Val 


Gin 


Ala 










245 








Ala 


Val 


Ala 


Ala 


lie 


He 


Leu 


Gly 










260 








Tyr 


Leu 


Asp 


His 


Ser 


lie 


Leu 


Leu 








275 








Ala 


Phe 


He 


Ala 


Ser 


Leu 


Leu 


Gin 










290 








He 


Val 


Gly 


Ser 


Lys 


Lys 


Thr 


Gly 










305 








Thr 


Pro 


lie 


Ala 


Ala 


Ser 


Phe 


Gly 










320 








Leu 


Ala 


Trp 


lie 


Ser 


Gin 


Gly 


Leu 










335 








Tyr 


Tyr 


He 


Ser 


Pro 


Leu 


Val 


Gly 










350 








Pro 


He 


Trp 


lie 


He 


lie 


Ala 


Ala 










365 








Val 


Leu 


His 


Ser 


Gly 


Trp Glu 


Pro 










380 








Ser 


Ser 


lie 


Gly 


Gly 


Leu 


He 


Leu 










395 








Asn 


Leu 


Val 


Gly 


He 


Val 


Val 


Tyr 










410 








Gly 


Gly 


Asn 


Leu 


Val 


Ala 


lie 


Gin 










425 








Leu 


His 


Leu 


His 


Ser 


lie 


Pro 


Gly 










440 








Gly 


Cys 


Tyr 


Tyr 


Pro 


Phe Arg 


Thr 










455 








Asn 


Lys 


Ser 


Ala 


Gin 


Val 


Leu 


Leu 










470 








Leu 


He 


Phe 


Leu 


Tyr 


Thr 


He 


His 










485 








Ser 


Leu 


Thr 


lie 


lie 


Phe 


He 


Val 










500 








Leu 


Gin 


Val 


Phe 


Thr 


Leu. 


Leu 


Trp 










515 








His 


Phe 


Trp 


Arg 


Lys 


Gly Lys 


Asp 










530 








Tyr 


Leu 


Thr 


Ala 


Leu 


Gly Asp 


Leu 










545 








Leu 


Ser 


Phe 


His 


Phe 


Leu 


.Trp 


Leu 










560 








Val 


Gly 


Asp 
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Leu Gly 


Leu 


Lys Gly Asn Leu 




205 




210 


Ser 


Thr 


Ala 


Val Asn He Gly 




220 




225 


Trp Asn 


Leu 


lie lie Gly Asn 




235 




240 


Thr 


Val 


Val 


Gly Phe Leu Ala 




250 




255 


Trp 


lie 


Pro 


Glu Gly Lys Tyr 




265 




270 


Cys 


Ser 


Ser 


Ser Val Ala Thr 




280 




285 


Gly He 


lie 


Met Val Gly Val 




295 




300 


lie 


Asn 


Pro 


Asp Asn Val Ala 




310 




315 


Asp 


Leu 


lie 


Thr Leu Ala lie 




325 




330 


Tyr 


Ser 


Cys 


Leu Glu Thr Tyr 




340 




345 


Val 


Phe 


Phe 


Leu Ala Leu Thr 




355 




360 


Lys 


His 


Pro 


Ala Thr Arg Thr 




370 




375 


Val 


lie 


Thr 


Ala Met Val lie 




385 




390 


Asp Thr 


Thr 


Val Ser Asp Pro 




400 




405 


Thr 


Pro 


Val 


lie Asn Gly lie 




415 




420 


Ala 


Ser 


Arg 


lie Ser Thr Tyr 




430 




435 


Glu 


Leu 


Pro 


Asp Glu Pro Lys 




445 




450 


Phe 


Phe 


Gly 


Pro Gly Val Asn 




460 




465 


Leu 


Leu 


Val 


lie Pro Gly His 




475 




480 


Leu 


Met 


Lys 


Ser Gly His Thr 




490 




495 


Val 


Tyr 


Leu 


Phe Gly Ala Val 




505 




510 


lie 


Ala 


Asp 


Trp Met Val His 




520 




525 


Pro 


Asp 


Ser 


Phe Ser He Pro 




535 




540 


Leu 


Gly 


Thr 


Ala Leu Leu Ala 




550 




555 


lie Gly 


Asp 


Arg Asp Gly Asp 




565 




570 



31/83 



3NSDOCID: <WO 024641 5A2_IA> 



Met Asp Ala Val -Lys Tyr Leu Asn Lys Leu Asn Leu Asp Asn He 
1 5 10 15 

Glu Leu Thr Lys Tyr Leu Phe Phe Thr Gly Lys Gly Gly Val Gly 
20 25 30 

Lys Thr Thr He Ser Ser Phe lie Ala Leu Asn Leu Ala Glu Asn 
35 40 45 

Gly Lys Lys Val Ala Leu Val Ser Thr Asp Pro Ala Ser Asn Leu 
50 55 60 

Gin Asp Val Phe Gin Met Glu Leu Ser Asn Lys Leu Thr Lys Tyr 
65 70 " A 75 

Gin Pro He Pro Asn Leu Ser He Ala Asn Phe Asp Pro lie Val 
80 85 90 

Ala Ala Asp Asp Tyr Lys Ala Gin Ser He Glu Pro Tyr Glu Gly 
95 100 105 

He Leu Pro Glu Asp Val Leu Ala Glu Met Lys Glu Gin Leu Ser 
HO 115 120 

Gly Ser Cys Thr Val Glu Val Ala Ala Phe Asn Glu Phe Thr Asn 
125 130 135 

Phe Leu Ser Asp Lys Thr Leu Glu Gin Glu Phe Asp Phe He He 
140 145 150 

Phe Asp Thr Ala Pro Thr Gly His Thr Leu Arg Met Leu Glu Leu 
155 160 165 

Pro Ser Ala Trp Thr Asp Tyr Leu Asn Thr Thr Ser Asn Asp Ala 
170 175 180 

Ser Cys Leu Gly Gin Leu Ser Gly Leu Asn Glu Asn Arg Val Lys 
185 190 195 

Tyr Asn Ser Ala Leu Glu Lys Leu Arg Asn Gin Asp Asp Thr Thr 
200 205 J 210 

Met Met Leu Val Ala Arg Pro Thr His Ser Ser He Tyr Glu He 
215 220 225 

Gin Arg Ala Gin Gin Glu Leu Gin Gin Leu Ser He Ser Lys Phe 
230 • 235 240 

Lys Val He He Asn Asn Tyr He Glu Glu Ser His Gly Leu He 
245 250 255 

Ser Ser Gin Met Lys Ser Glu Gin Asp Lys Asn He Asn His Phe 
260 265 270 

Thr Glu Trp Leu Asn Asn Asn His Ala Tyr Tyr Val Pro Tyr Lys 
275 280 285 

Lys Gin Lys Glu Glu Gly He Glu Asn Leu Thr Asn Leu Leu Asn 
290 295 300 

Asp Asp Asn Leu He Glu Asn Asp Asp Phe He Val Glu Asp His 
305 310 315 

Pro Gin Phe Asn Lys Leu He Asp Glu He Glu Asn Ser Lys Val 
320 325 330 

Gin Tyr Leu Phe Thr Met Gly Lys Gly Gly Val Gly Lys Thr Thr 
335 340 " 345 

Val Ala Thr Gin Leu Ala Thr Ala Leu Ser Asn Lys Gly Tyr Arg 
350 355 360 

Val Leu Leu Ala Thr Thr Asp Pro Thr Lys Glu He Asn Val Glu 
365 370 375 

Thr Thr Ser Asn Leu Asn Thr Ala Tyr He Asp Glu Glu Gin Ala 
380 385 390 

Leu Glu Lys Tyr Lys Lys Glu Val Leu Ala Thr Val Asn Asp Asp 
395 400 405 

Thr Pro Gin Asp Asp lie Asp Tyr He Met Glu Asp Leu Lys Ser 
410 415 420 

Pro Cys Thr Glu Glu He Ala Phe Phe Lys Ala Phe Ser Asp He 
425 430 435 

Met Glu Asn Gin Asp Asp Met Asp Tyr Val lie Val Asp Thr Ala 
440 445 450 

Pro Thr Gly His Thr Leu Leu Leu Leu Asp Ser Ser Glu Asn His 
455 460 465 

His Arg Glu Leu Lys Lys Lys Ser Thr Gin Thr Thr Ser Asn Val 
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470 




Glu 


Thr 


Leu 


Leu 


Pro 
485 


Lys He Gin 


He 


He 


Val 


Thr 


Leu 
500 


Ala Glu Lys 


Arg 


Leu 


Val 


Glu 


Asp 
515 


Leu Asn Arg 


Trp 


Val 


Val 


Asn 


Gin 
530 


Ser Leu Val 


Leu 


Phe 


Ser 


Asn 


Lys Lys Glu Asp 










545 




He 


Lys 


Asn 


Glu 


Ser 
560 


Phe Asp Asn 


Gly 


Leu 


Ser 









<210> 20 

<211> 248 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: 7483850CD1 

<400> 20 



Met 


Asn 


Ser Asn Leu Asn Leu Asp 


1 








5 


Val 


Met 


Asn 


Ala 


He Ser Ser Leu 










20 


Phe 


Leu 


Glu 


Tyr 


Phe Ser Thr Cys 










35 


Gly 


Ser 


Phe Leu Ser Thr Cys He 










50 


Ala 


Leu 


Ser 


Val 


Met He Ala Gly 










65 


His 


Phe 


Pro 


Ala 


Val Glu Gin Pro 










80 


Val 


Ser 


Ser 


Met 


Pro Cys Phe Tyr 










95 


Leu 


Gly 


Val 


Ala 


Glu Thr Leu Val 










110 


Ser 


Tyr 


Arg 


Phe 


Val Pro Ser Asn 










125 


Phe 


Leu 


Thr 


Leu 


Phe Asn Gly Phe 










140 


Leu 


Val 


Lys 


Leu 


Val Tyr Leu He 










155 


Trp 


Phe 


Pro 


Asn 


Thr Leu Asn Lys 










170 


Phe 


Phe 


Leu 


Ala 


Ser Leu Thr Leu 










185 


Ser 


Val 


Ser Gin Arg Tyr Cys Asn 










200 


Asn 


He 


Arg Gly Ser Asn Leu Glu 










215 


Lys 


Ser 


Leu 


Lys 


Phe Tyr Gly Ser 










230 


He 


Asp 


Leu Trp Glu Thr Ala Leu 










245 



<210> 21 
<211> 761 





475 




480 


Asn 


Lys 


Asn Leu Thr Gin 


Met 




490 




495 


Thr 


Pro 


Tyr Leu Glu Ser 


Lys 




505 




510 


Ala 


Asn 


He Gly His Asn 


Trp 




520 




525 


Thr 


Leu 


Asn Gin Arg Asp 


Asp 




535 




540 


Glu 


Ser 


Phe Trp He Asn 


Lys 




550 




555 


Tyr 


Phe 


Val lie Pro Tyr 


Gly. 




565 




570 


Gly Phe 


Leu Leu Pro He 


Ala 




10 




15 


Pro 


Leu 


Leu He Leu Ala 


Pro 




25 




30 


Leu 


Phe 


Pro Ser Lys Arg 


Val 




40 




45 


He 


Ala 


Gly Asn Leu Phe 


Ala 




55 




60 


Phe 


Phe 


Glu He His Arg 


Lvs 




70 




75 


Leu 


Ser 


Gly Lys Val Leu 


Thr 




85 




90 


Leu 


He 


Leu Gin Tyr Val 


Leu 




100 




105 


Asn 


Pro 


Ala Leu Ser Val 


He 




115 




120 


Val 


Arg 


Gly Thr Ser Met 


Asn 




130 




135 


Gly Cys 


Phe Thr Gly Ala 


Leu 




145 




150 


Ser 


Glu 


Gly Lys Asn Arg 


Gin 




160 




165 


Gly Asn 


Leu Glu Gly Phe 


Phe 




175 




180 


Leu 


Asn 


Val Leu Gly Phe 


Cys 




190 




195 


Leu 


Asn 


His Phe Asn Ala 


Gin 




205 




210 


Glu 


Thr 


Leu Leu Leu His 


Glu 




220 




225 


He 


Gin 


Glu Phe Ser Ser 


Ser 




235 




240 
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<212> PRT 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 5508353CD1 

<400> 21 



Met 


Lys 


Pro 


Ala 


Ser 


Pro 


Val 


Glu 


1 








5 








Glu 


Ser 


Pro 


Gin 


Cys 


Ser 


Ser 


Ser 










20 








Glu 


Lys 


Gin 


His 


Gly Asp 


Ala 


Gly 










35 








Ser 


Leu 


Pro 


Gly 


Gin 


Pro 


Leu 


Ala 










50 








Glu 


Ser 


Pro 


Asp 


Glu 


Ala 


Ala 


Leu 










65 








Gin 


Cys 


Thr 


Leu 


Arg 


Ser 


Arg 


Thr 










80 








Phe 


Ala 


Ala 


Leu 


Gly 


Pro 


Leu 


Thr 










95 








Pro 


Phe 


Asp 


Ser 


Val 


Arg 


Lys 


Arg 










110 








Pro 


Leu 


Ser 


Asn 


Gin 


Val 


Val 


Val 










125 








Val 


He 


Met 


Glu 


Leu 


Leu 


Ser 


Val 










140 








Leu 


Glu 


Lys 


Gin 


Gin 


Met 


He 


Val 










155 








Leu 


Asp Asp Tyr 


Ala 


Lys 


Gin 


Gly 










170 








Lys 


Lys 


Val 


Met 


Ser 


Asp 


Thr 


Glu 










185 








His 


Phe 


Leu 


Ala 


Glu 


Thr 


Ser 


He 










200 








Leu 


Glu 


Ser 


Ala 


Met 


Arg 


Leu 


Glu 










215 








Ala 


Thr 


Gly 


He 


Glu 


Asp 


Arg 


Leu 










230 








He 


Glu 


Ala 


Leu 


His 


Lys 


Ala 


Gly 










245 








Gly 


Asp 


Lys 


Gin 


Glu 


Thr 


Ala 


Val 










260 








Leu 


Leu 


Glu 


Pro 


Asp Asp 


Lys 


Leu 










275 








Lys 


Asp 


Ala 


Cys 


Gly Met 


Leu 


Met 










290 








Gin 


Lys 


Lys 


Thr 


Gin 


Ala 


Leu 


Pro 










305 








Asp 


Leu 


Leu 


Gin 


Pro 


Pro 


Val 


Pro 










320 








Gly 


Leu 


He 


He 


Thr 


Gly 


Lys 


Thr 










335 








Ser 


Leu 


Gin 


Lys 


Gin 


Phe 


Leu 


Glu 










350 








Val 


Val 


Cys 


Cys 


Arg 


Ala 


Thr 


Pro 










365 








Lys 


Leu 


Val 


Arg 


Ser 


His 


Leu 


Gin 










380 








Asp 


Gly Ala Asn Asp Val 


Ser 


Met 










395 








He 


Gly Val 


Ser 


Gly Gin 


Glu Gly 



Glu 


Glu 


Val 


Ser 


Gin 


Val 


Cys 




10 










15 


Ser 


Ala 


Cys 


Cys 


Thr 


Glu 


Thr 




25 










30 


Leu 


Leu 


Asn 


Gly 


Lys 


Ala 


Glu 




40 










45 


Cys 


Asn 


Leu 


Cys 


Tyr 


Glu 


Ala 




55 










60 


Val 


Tyr Ala 


Ala 


Arg 


Ala 


Tyr 




70 










75 


Pro 


Glu 


Gin 


Val 


Met 


Val 


Asp 




85 










90 


Phe 


Gin 


Leu 


Leu 


His 


He 


Leu 




100 










105 


Met 


Ser 


Val 


Val 


Val 


Arg 


His 




115 










120 


Tyr 


Thr 


Lys 


Gly 


Ala 


Asp 


Ser 




130 










135 


Ala 


Ser 


Pro 


Asp 


Gly 


Ala 


Ser 




145 










150 


Arg 


Glu 


Lys 


Thr 


Gin 


Lys 


His 




160 










165 


Leu 


Arg 


Thr 


Leu 


Cys 


He 


Ala 




175 










180 


Tyr 


Ala 


Glu 


Trp 


Leu 


Arg 


Asn 




190 










195 


Asp 


Asn Arg 


Glu 


Glu 


Leu 


Leu 




205 










210 


Asn 


Lys 


Leu 


Thr 


Leu 


Leu 


Gly 




220 










225 


Gin 


Glu 


Gly 


Val 


Pro 


Glu 


Ser 




235 










240 


He 


Lys 


He 


Trp 


Met 


Leu 


Thr 




250 










255 


Asn 


He 


Ala 


Tyr 


Ala 


Cys 


Lys 




265 










270 


Phe 


He 


Leu 


Asn 


Thr 


Gin 


Ser 




280 










285 


Ser 


Thr 


He 


Leu 


Lys 


Glu 


Leu 




295 










300 


Glu 


Gin 


Val 


Ser 


Leu 


Ser 


Glu 




310 










315 


Arg 


Asp 


Ser 


Gly 


Leu 


Arg 


Ala 




325 










330 


Leu 


Glu 


Phe 


Ala 


Leu 


Gin 


Glu 




340 










345 


Leu 


Thr 


Ser 


Trp 


Gly 


Gin 


Ala 




355 










360 


Leu 


Gin 


Lys 


Ser 


Glu 


Val 


Val 




370 










375 


Val 


Met 


Thr 


Leu 


Ala 


He 


Gly 




385 










390 


He 


Gin 


Val 


Ala 


Asp 


He 


Gly 




400 








405 


Met 


Gin 


Ala 


Val 


Met 


Ala 


Ser 
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410 415 420 

Asp Phe Ala Val Ser Gin Phe Lys His Leu Ser Lys Leu Leu Leu 
425 430 435 

Val His Gly His Trp Cys Tyr Thr Arg Leu Ser Asn Met lie Leu 
440 445 450 

Tyr Phe Phe Tyr Lys Asn Val Ala Tyr Val Asn Leu Leu Phe Trp 
455 460 465 

Tyr Gin Phe Phe Cys Gly Phe Ser Gly Thr Ser Met Thr Asp Tyr 

470 475 480 

Trp Val Leu He Phe Phe Asn Leu Leu Phe Thr Ser Ala Pro Pro 

485 490 495 

Val He Tyr Gly Val Leu Glu Lys Asp Val Ser Ala Glu Thr Leu 

500 505 510 

Met Gin Leu Pro Glu Leu Tyr Lys Ser Gly Gin Lys Ser Glu Ala 

515 520 ~* 525 

Tyr Leu Pro His Thr Phe Trp He Thr Leu Leu Asp Ala Phe Tyr 

530 535 540 

Gin Ser Leu Val Cys Phe Phe Val Pro Tyr Phe Thr Tyr Gin Gly 

545 550 555 

Ser Asp Thr Asp He Phe Ala Phe Gly Asn Pro Leu Asn Thr Ala 

560 565 570 

Ala Leu Phe He Val Leu Leu His Leu Val He Glu Ser Lys Ser 

575 580 585 

Leu Thr Trp He His Leu Leu Val He He Gly Ser He Leu Ser 

590 595 600 

Tyr Phe Leu Phe Ala lie Val Phe Gly Ala Met Cys Val Thr Cys 

605 610 " 615 

Asn Pro Pro Ser Asn Pro Tyr Trp He Met Gin Glu His Met Leu 

620 625 630 

Asp Pro Val Phe Tyr Leu Val Cys He Leu Thr Thr Ser lie Ala 

635 640 645 

Leu Leu Pro Arg Phe Val Tyr Arg Val Leu Gin Gly Ser Leu Phe 

650 655 660 

Pro Ser Pro He Leu Arg Ala Lys His Phe Asp Arg Leu Thr Pro 

665 670 ~ 675 

Glu Glu Arg Thr Lys Ala Leu Lys Lys Trp Arg Gly Ala Gly Lys 

680 685 690 

Met Asn Gin Val Thr Ser Lys Tyr Ala Asn Gin Ser Ala Gly Lys 

695 700 705 

Ser Gly Arg Arg Pro Met Pro Gly Pro Ser Ala Val Phe Ala Met 

710 715 720 

Lys Ser Ala Ser Ser Cys Ala lie Glu Gin Gly Asn Leu Ser Leu 

725 730 735 

Cys Glu Thr Ala Leu Asp Gin Gly Tyr Ser Glu Thr Lys Ala Phe 

740 745 750 

Glu Met Ala Gly Pro Ser Lys Gly Lys Glu Ser 

755 760 

<210> 22 

<211> 219 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 8543628CD1 

<400> 22 

Met Ala Ser Ser Gly Leu Glu Leu Leu Trp Val Ser Leu Pro Gin 

1 5 10 15 

Leu Gly Lys Gly Ala Ala Gin Thr Leu Ser lie Ser Phe Leu Ser 

20 25 30 

He Gly Phe Ser Thr Val Gly Gly Val Leu Tyr Gly Val Leu Arg 
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35 










40 






45 


Thr Leu 


Asn 


Asn 


Lys 


Ala 


He 


Asn 


Gly Val 


Leu Arg Val 


Tyr 


Leu 








50 










55 




60 


Glu Leu 


Phe 


Arg 


Ala 


He 


Pro 


Val 


Leu 


Val 


Trp Leu Tyr 


Leu 


Leu 








65 










70 




75 


Phe Phe Gly 


Val 


Pro 


He 


Phe 


Phe 


Gly Leu 


Ser He Pro 


Ser 


Phe 








80 










85 






90 


Trp Cys 


Ala 


Val 


Leu 
95 


Val 


Leu 


Ser 


Leu 


Trp 
100 


Gly Ala Ser 


Glu 


Val 
105 


Gly Glu 


Val 


Val 


Arg 


Gly Ala 


Leu 


His 


Ser 


Leu Pro Arg Gly Gin 








110 










115 






120 


Arg Glu 


Ala 


Gly Leu 


Ser 


He 


Gly Leu 


Ser 


Asp Leu Gin 


Leu 


Tyr 








125 










130 






135 


Gly Tyr Val 


Leu 


Leu 


Pro 


Gin 


Ala 


Leu 


Arg 


Arg Met Thr 


Pro 


Pro 


Thr lie 






140 










145 






150 


Asn 


Val 


Tyr 
155 


Thr 


Arg 


He 


He 


Lys 
160 


Thr Ser Ser 


Leu 


Ala 
165 


Val Leu 


He 


Gly Val 


Val 


Asp 


Val 


He 


Lys 


Val Gly Gin Gin 


He 








170 










175 






180 


He Glu 


Arg 


Thr 


Tyr 


Glu 


Ser 


Val 


Leu 


He Tyr Gly Ala 


Leu 


Phe 








185 










190 






195 


Leu Phe 


Phe 


Phe 


Phe 
200 


He 


Cys 


Tyr 


Pro 


Leu 
205 


Ser Ala Ala 


Ser 


Lys 
210 


Leu Leu 


Glu 


Arg 


Arg 
215 


Trp 


Ala 


Gin 


Ala 











<210> 23 
<211> 463 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc__f eature 

<223> Incyte ID No: 7482754CD1 

<400> 23 



Met 


Glu 


Gly 


Gin 


Thr 


Pro 


Gly 


Ser 


Arg 


Gly Leu 


Pro 


Glu 


Lys 


Pro 


1 








5 










10 








15 


His 


Pro 


Ala 


Thr 


Ala 


Ala 


Ala 


Thr 


Leu 


Ser 


Ser 


Met 


Gly 


Ala 


Val 


Phe 








20 










25 








30 


He 


Leu 


Met 


Lys 


Ser 


Ala 


Leu 


Gly 


Ala 


Gly Leu 


Leu 


Asn 


Phe 










35 










40 










45 


Pro 


Trp 


Ala 


Phe 


Ser 


Lys 


Ala 


Gly Gly Val 


Val 


Pro 


Ala 


Phe 


Leu 










50 










55 










60 


Val 


Glu 


Leu 


Val 


Ser 


Leu 


Val 


Phe 


Leu 


He 


Ser 


Gly Leu Val 


He 










65 










70 










75 


Leu 


Gly 


Tyr 


Ala 


Ala 


Ala 


Val 


Ser 


Gly 


Gin 


Ala 


Thr Tyr Gin Gly 










80 










85 










90 


Val 


Val 


Arg 


Gly Leu 


Cys 


Gly Pro 


Ala 


He Gly Lys 


Leu 


Cys 


Glu 


Ala 








95 










100 








105 


Cys 


Phe 


Leu 


Leu 


Asn 


Leu 


Leu 


Met 


He 


Ser 


Val 


Ala 


Phe 


Leu 










110 










115 










120 


Arg 


Val 


He 


Gly Asp Gin 


Leu 


Glu 


Lys 


Leu 


Cys 


Asp 


Ser 


Leu 


Leu 










125 










130 










135 


Ser 


Gly 


Thr 


Pro 


Pro 


Ala 


Pro 


Gin 


Pro 


Trp 


Tyr 


Ala 


Asp 


Gin 


Arg 


Phe 








140 










145 










150. 


Thr 


Leu 


Pro 


Leu 


Leu 


Ser 


Val 


Leu 


Val 


He 


Leu 


Pro 


Leu 


Ser 










155 










160 










165 


Ala 


Pro 


Arg 


Glu 


He 


Ala 


Phe 


Gin 


Lys 


Tyr 


Thr 


Ser 


Pro 


Ser 


His 


Gly 








170 










175 










180 


His 


Cys 


Val 


Ser 


He 


Leu 


Gly Thr 


Leu 


Ala 


Ala 


Cys 


Tyr 


Leu 










185 










190 






195 


Ala 


Leu 


Val 


He 


Thr 


Val 


Gin 


Tyr 


Tyr 


Leu 


Trp 


Pro 


Gin 


Gly Leu 
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200 205 210 

Val Arg Glu Ser His Pro Ser Leu Ser Pro Ala Ser Trp Thr Ser 

215 220 225 

Val Phe Ser Val Phe Pro Thr He Cys Phe Gly Phe Gin Cys His 

230 235 240 

Glu Ala Ala Val Ser He Tyr Cys Ser Met Arg Lys Arg Ser Leu 

245 250 255 

Ser His Trp Ala Leu Val Ser Val Leu Ser Leu Leu Ala Cys Cys 

260 265 270 

Leu He Tyr Ser Leu Thr Gly Val Tyr Gly Phe Leu Thr Phe Gly 

275 280 285 

Thr Glu Val Ser Ala Asp Val Leu Met Ser Tyr Pro Gly Asn Asp 

290 295 300 

Met Val He He Val Ala Arg Val Leu Phe Ala Val Ser He Val 

305 310 315 

Thr Val Tyr Pro He Val Leu Phe Leu Gly Arg Ser Val Met Gin 

320 325 330 

Asp Phe Trp Arg Arg Ser Cys Leu Gly Gly Trp Gly Pro Ser Ala 

335 340 345 

Leu Ala Asp Pro Ser Gly Leu Trp Val Arg Met Pro Leu Thr He 

350 355 360 

Leu Trp Val Thr Val Thr Leu Ala Met Ala Leu Phe Met Pro Asp 

365 370 375 

Leu Ser Glu He Val Ser He He Gly Gly He Ser Ser Phe Phe 

380 385 390 

• He Phe He Phe Pro Gly Pro Gly Gly Lys His Arg Phe Cys Glu 

395 400 405 

Gly Trp Phe Leu Arg Lys Pro Ser Gin Thr Ala Gin Ala Leu Ser 

410 415 420 

Glu Lys Gly Lys Pro Ala Arg Phe Val Pro His Leu Cys Asn Gly 

425 430 435 

Cys Arg Ala Tyr Arg Thr Lys Ser Gin Val Arg Asn Leu Arg Ala 

440 445 450 

Thr Gly Arg Glu Ser Leu Ser Thr Gin Tyr Glu Gly He 

455 460 

<210> 24 

<211> 1043 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 3794818CD1 

<400> 24 

Met Glu Phe Val Arg Ala Leu Trp Leu Gly Leu Ala Leu Ala Leu 
1 5 10 15 

Gly Pro Gly Ser Ala Gly Gly His Pro Gin Pro Cys Gly Val Leu 

20 25 30 

Ala Arg Leu Gly Gly Ser Val Arg Leu Gly Ala Leu Leu Pro Arg 

35 40 45 

Ala Pro Leu Ala Arg Ala Arg Ala Arg Ala Ala Leu Ala Arg Ala 

50 55 60 

Ala Leu Ala Pro Arg Leu Pro His Asn Leu Ser Leu Glu Leu Val 

65 70 75 

Val Ala Ala Pro Pro Ala Arg Asp Pro Ala Ser Leu Thr Arg Gly 

80 85 ~ 90 

Leu Cys Gin Ala Leu Val Pro Pro Gly Val Ala Ala Leu Leu Ala 

95 100 105 

Phe Pro Glu Ala Arg Pro Glu Leu Leu Gin Leu His Phe Leu Ala 
110 115 1 120 

Ala Ala Thr Glu Thr Pro Val Leu Ser Leu Leu Arg Arg Glu Ala 
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Arg 


Ala 


Pro 


Trp 


Ala 


Ser 


Leu 


Gin 


Ala 


Thr 


Gin 


Asp 


Gly 


Arg 


Pro 


Gly 


Asp 


Ala 


Val 


Gly 


Gly 


Asp 


He 


Ala 


Pro 


His 


Trp 


Thr 


Ala 


Gly 


Arg 


Pro 


Pro 


Ala 


Arg 


Ala 


Leu 


Leu 


Pro 


Pro 


Glu 


Ser 


Ser 


Phe 


Gin 


Gin 


Val 


His 


Asp 


Pro 


Arg 


Asp 


Gly 


Gin 


Pro 


Pro 


Pro 


Thr 


Leu 


Leu 


Asp 


Gly 


Gin 


Asn 


Asp 


Ser 


Gly 


Ser 


Ala 


He 


Asp 


Leu 


Glu 


Leu 


Tyr 


Gly 


Arg 


Trp 


His 


Met 


Ala 


Val 


Val 


Asp 


Met 


Val 


Arg 


Trp 


Pro 


Leu 


His 


Leu 


Thr 





125 




Leu 


Gly 


Ala 




140 




Pro 


Leu 


Glu 




155 




His 


Ala 


Trp 




170 




Pro 


Gly 


Gly 




185 




Pro 


Gin 


Leu 




200 




Gly 


Leu 


Arg 




215 




Glu 


Ala 


Pro 




230 




Arg 


Ala 


Arg 




245 




Leu 


Leu 


Gly 




260 




Leu 


Pro 


Pro 




275 




Leu 


Glu 


Ala 




290 




Leu 


Gly 


Ser 




305 




Ala 


Pro 


Val 




320 




Pro 


Gly 


Arg 




335 




Gly 


Arg 


Thr 




350 




Met 


Ser 


Arg 




365 




Gly 


Ala 


Pro 




380 




Leu 


Asp 


Leu 




395 




Gin 


Gly 


Ala 




410 




Glu 


His 


Pro 




425 




Cys 


Pro 


Ala 




440 




Ala 


Thr 


Leu 




455 




Pro 


Arg 


Ala 




470 




Leu 


Glu 


Arg 




485 




Leu 


Val 


Gly 




500 




Thr 


Gly 


Leu 




515 




Val 


Thr 


Ser 




530 




Phe 


Thr 


Ser 




545 




Ala 


Arg 


Asp 




560 




His 


Trp 


Ser 




575 




Ala 


Leu 


Phe 




590 





Pro 


Asn 


Pro 


Thr 


Leu 


Leu 


Glu 


Asp 


Val 


Leu 


Val 


Ala 


Val 


Leu 


Asp 


Ala 


Arg 


Leu 


Val 


Pro 


Ala 


Arg 


Val 


Leu 


Thr 


Pro 


Leu 


Gly 


Leu 


Leu 


Ala 


He 


His 


Ala 


Ala 


Gin 


Asn 


Cys 


Gly 


Phe 


Leu 


Ala 


Gly 


Pro 


Val 


His 


Phe 


Lys 


Ala 


Trp 


Ala 


Glu 


Pro 


Gly 


Gin 


Val 


Trp 


Phe 


Val 


Phe 


Gly 


Gin 


Leu 


Asp 


Ala 


Leu 


Leu 


Arg 


Lys 


Leu 


Ala 


Glu 


Asp 


Gly 


Lys 


Val 


Gly 


Asp 


Phe 


Ser 


He 


Pro 


Phe 


Phe 


Thr 


Ala 


Ser 


Thr 


Trp 


Leu 


Leu 


Thr 


Val 



130 






Phe 


His 


Leu 


145 






Asp 


Val 


Leu 


160 






Glv 


Leu 


Ala 


175 






Leu 




Thr 


190 






Leu 


Ser 


Arq 


205 






Ala 


Pro 


Met 


220 






Ala 


Val 


Leu 


235 






Glu 


Ala 


Val 


250 






Pro 


Pro 


Lys 


265 






Ala 


Leu 


Glv 


280 








He 


Val 


295 






Val 


Gin 


Pro 


310 






Asp 


Leu 


Gin 


325 






Arc 


Phe 


Leu 


340 






Trp 


Val 


Thr 


355 






Val 


Trp 


Ser 


370 






Thr 


Val 


Gly 


385 






Gly 


Ala 


Ser 


400 






Pro 


Lys 


Leu 


415 






Ala 


Arg 


Asp 


430 






Cvs 


Leu 


ASD 


445 






Phe 


Ala 


Ala 


460 






Cvs 


Cys 


Tvr 


475 






Asp 


Thr 


Pro 


490 






Tyr 


Gly 


Ala 


505 






Leu 


Leu 


Ala 


520 






Asn 


Ser 


Ala 


535 






Ser 


Thr 


Ser 


550 






Pro 


He 


Gly 


565 






Gly 


Val 


Phe 


580 






Tyr 


Glu 


Trp 


595 







38/83 



135 

Gin Leu His 
150 

Val Ala Val 
165 

Leu Cys Arg 
180 

Ser Arg Ala 
195 

Arg Asp Thr 
210 

Ala Ala Pro 
225 

Leu Gly Cys 
240 

Pro Pro Gly 
255 

Ala Leu Pro 
270 

Glu Val Ala 
285 

Gin Leu Val 
300 

Lys Arg Ala 
315 

Pro Ala Gly 
330 

Ala Asn Thr 
345 

Gly Ser Ser 
360 

Leu Arg Arg 
375 

Ser Trp Arg 
390 

Ala Arg Pro 
405 

Arg Val Val 
420 

Pro Asp Glu 
435 

Pro Gly Thr 
450 

Leu Ala Asn 
465 

Gly Tyr Cys 
480 

Phe Asp Phe 
495 

Leu Arg Asp 
510 

Gly Arg Ala 
525 

Arg Ser Gin 
540 

Leu Gly He 
555 

Ala Phe Met 
570 

Ala Ala Leu 
585 

Arg Ser Pro 
600 
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Tyr Gly Leu 


Thr Pro Arg Gly Arg 










605 




Tyr 


Ser 


Ser 


Ala 


Leu 


Asn Leu Cys 










620 




Thr 


Val 


Ser 


Ser 


Lys 


Thr Pro Lys 










635 




Met 


Asn 


Leu 


Trp 


Ala 


He Phe Cys 










650 




Thr 


Ala 


Asn 


Leu 


Ala 


Ala Val Met 










665 




Glu 


Leu 


Ser 


Gly He His Asp Pro 










680 




Gly Phe Arg 


Phe Gly Thr Val Trp 










695 




lie 


Lys 


Lys 


Ser 


Phe 


Pro Asp Met 










710 




Ser 


Ala 


Pro 


Thr 


Thr 


Pro Arg Gly 










725 


Pro 


Pro 


Lys 


Leu 


Asn 


Ala Phe He 










740 




Tyr 


Glu 


Val 


Ser 


He 


Asp Ala Asp 










755 




Lys 


Pro 


Phe 


Ala 


He 


Glu Gly Tyr 










770 




Ser 


Pro 


Leu 


Thr 


Ser 


Asn Lou Ser 










785 




Ser 


Ser Gly 


Phe 


He 


Asp Leu Leu 










800 




Val 


Pro Cys 


Gly 


Lys 


Arg Val Phe 










815 




Met 


Ser 


He 


Tyr 


His 


Phe Ala Gly 










830 




Gly Leu Gly 


Ser 


Ala 


Leu Leu Ser 










845 




Phe Arg Leu 


Ala 


Leu 


Pro Arg He 










860 




Tyr 


Trp 


Leu 


His 


Thr 


Ser Gin Lys 










875 




Glu 


Pro 


Pro 


Glu Gly Ser Lys Glu 










890 




Ser Gly 


Pro 


Glu 


Val 


Glu Gin Gin 










905 




Thr 


Ala 


Pro 


Glu Gly Trp Lys Arg 










920 




Glu 


Arg 


Arg 


Val 


Arg 


Phe Leu Leu 










935 




Pro 


Glu 


Ala 


Asp 


Ala 


Glu Ala Glu 










950 




Val 


Trp 


Leu 


Cys Ser Tyr Gly Arg 










965 




Gly Ala 


Pro 


Gin Pro Gly Glu Leu 










980 




Glu 


Val 


Ala 


Arg 


Glu 


Arg Leu Arg 










995 




Gin 


Leu 


Leu 


Ala Gin Leu Gly Asp 








1010 




Arg 


Leu 


Leu 


Gin 


Ala 


Arg Ala Ala 








1025 




Ser 


Gly Arg 


Pro Gly 


Ser Gin Glu 








1040 
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Asn Arg Ser Thr 


Val 


Phe 


Ser 


610 






615 


Tyr Ala He Leu 


Phe 


Arg 


Arg 


625 






630 


Cys Pro Thr Gly Arg 


Leu 


Leu 


640 






645 


Leu Leu Val Leu 


Ser 


Ser 


Tyr 


655 






660 


Val Gly Asp Lys 


Thr 


Phe 


Glu 


670 






675 


Lys Leu His His 


Pro 


Ala 


Gin 


685 






690 


Glu Ser Ser Ala 


Glu 


Ala 


Tyr 


700 






705 


His Ala His Met 


Arg 


Arg 


His 


715 






720 


Val Ala Met Leu 


Thr 


Ser 


Asp 


730 






735 


Met Asp Lys Ser 


Leu 


Leu 


Asd 


745 






750 


Cys Lys Leu Leu 


Thr 


Val 


Gly 


760 






765 


Gly He Gly Leu 


Pro 


Gin 


Asn 


775 






780 


Glu Phe He Ser Arg 


Tyr 


Lys 


790 






795 


His Asp Lys Trp 


Tyr 


Lys 


Met 


805 






810 


Ala Val Thr Glu 


Thr 


Leu 


Gin 


820 






825 


Leu Phe Val Leu 


Leu 


Cys 


Leu 


835 






840 


Ser Leu Gly Glu His 


Ala 


Phe 


850 






855 


Arg Lys Gly Ser Arg 


Leu 


Gin 


865 






870 


He His Arg Ala Leu 


Asn 


Thr 


880 






885 


Glu Thr Ala Glu 


Ala 


Glu 


Pro 


895 






900 


Gin Gin Gin Gin 


Asp 


Gin 


Pro 


910 






915 


Ala Arg Arg Ala Val 


Asp 


Lys 


925 






930 


Glu Pro Ala Val 


Val 


Val 


Ala 


940 






945 


Ala Ala Pro Arg 


Glu 


Gly 


Pro 


955 






960 


Pro Pro Ala Ala Arg 


Pro 


Thr 


970 






975 


Gin Glu Leu Glu Arg 


Arg 


He 


985 






990 


Gin Ala Leu Val 


Arg 


Arg 


Gly 


1000 




1005 


Ser Ala Arg His Arg 


Pro 


Arg 


1015 




1020 


Pro Ala Glu Ala 


Pro 


Pro 


His 


1030 




1035 
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Met 


Arg 


Gly 


Ser 


Pro 


Gly 


Asp 


Ala 


Glu 


Arg 


Arg 


Gin 


Arg 


Trp 


Gly 


1 








5 










i o 










1 5 


Arg^ Leu 


Phe 


Glu 


Glu 


Leu 


Asp 


Ser 


Asn 


Lys 


Asp 


Gly 


Arg 


Vai 


Asp 










20 










9 5 












vai 


HIS 


Glu 


Leu 


Arg 
35 


Gin 


Gly 


Leu 


Ala 


Arg 
40 


Leu 


Gly 


Gly 


Gly 


Asn 

A R 


Pro 


ASp 


Pro 


Gly 


Ala 
50 


Gin 


Gin 


Gly 


He 


Ser 

5 5 


Ser 


Glu 


Gly 


Asp 


Ala 

D u 


Asp 


Pro 


Asp 


Gly 


Gly 
65 


Leu 


Asp 


Leu 


Glu 


Glu 
n o 


Phe 


Ser 


Arg 


Tyr 


Leu 

/ D 


bin 


G1U 


Arg 


Glu 


Gin 
80 


Arg 


Leu 


Leu 


Leu 


Met 

ft 5 
O D 


Phe 


His 


Ser 


Leu 


Asp 
o n 


Arg 


Asn 


Gin 


Asp 


Gly 


His 


He 


Asp 


Vai 


Ser 
i no 

JL U U 


Glu 


He 


Gin 


Gin 


Ser 


rtie 


Arg 


Ala 


Leu 


Gly 
110 


He 


Ser 


He 


Ser 


Leu 

115 

.L _L J 


Glu 


Gin 


Ala 


Glu 


Lys 

1 9 C\ 


lie 


Leu 


His 


Ser 


Met 
125 


Asp 


Arg 


Asp 


Gly 


Thr 
i ^o 

1 J u 


Met 


Thr 


He 


Asp 


Trp 

1 ^ 5 


Gin 


Glu 


Trp 


Arg 


Asp 
140 


His 


Phe 


Leu 


Leu 


His 

14 5 
J. f± D 


Ser 


Leu 


Glu 


Asn 


Vai 

i cn 
1DU 


Glu 


ASp 


Vai 


Leu 


Tyr 

1 5 5 
j. j j 


Phe 


Trp 


Lys 


His 


Ser 

1 AO 


Thr 


Vai 


Leu 


Asp 


He 
lo j 


Giy 


blU 


Cys 


Leu 


Thr 
1 70 


Vai 


Pro 


Asp 


Glu 


Phe 

1 75 


Ser 


Lys 


Gin 


Glu 


Lys 

1 OA 
1 O U 


Leu 


i nr 


Gly 


Met 


Trp 
185 


Trp 


Lys 


Gin 


Leu 


Vai 
i on 


Ala 


Gly 


Ala 


Vai 


Ala 


Giy 


7V. 1 

Ala 


Vai 


Ser 


Arg 
200 


Thr 


Gly 


Thr 


Ala 


Pro 

9 05 


Leu 


Asp 


Arg 


Leu 


Lys 

91 0 


Vai 


Jrne 


Met 


Gin 


Vai 
215 


His 


Ala 


Ser 


Lys 


Thr 
990 


Asn 


Arg 


Leu 


Asn 


He 

9 9 5 


Leu 


\jiy 


Gly 


Leu 


Arg 
230 


Ser 


Met 


Vai 


Leu 


Glu 
9^5 


Gly 


Gly 


He 


Arg 


Ser 

9 A O 


Leu 


Trp 


Arg 


Gly 


Asn 
245 


Gly 


He 


Asn 


Vai 


Leu 

950 

£m J \J 


Lys 


He 


Ala 


Pro 


Glu 
95 5 


Sex* 


nia 


lie 


Lys 


Phe 
260 


Met 


Ala 


Tyr 


Glu 


Gin 
265 


He 


Lys 


Arg 


Ala 


He 
270 


Leu 


o j.y 


Gin 


Gin 


Glu 
275 


Thr 


Leu 


His 


Vai 


Gin 
280 


Glu 


Arg 


Phe 


Vai 


Ala 

9ft 5 

4& O J 


Gly 


Ser 


Leu 


Ala 


Gly 
290 


Ala 


Thr 


Ala 


Gin 


Thr 
295 


He 


He 


Tyr 


Pro 


Met 
300 


Glu 


Thr 


Leu 


Lys 


Asn 
305 


Trp 


Trp 


Leu 


Gin 


Gin 
310 


Tyr 


Ser 


His 


Asp 


Ser 
315 


Ala 


Asp 


Pro 


Gly 


lie 
320 


Leu 


Vai 


Leu 


Leu 


Ala 
325 


Cys 


Gly 


Thr 


He 


Ser 
330 


Ser 


Thr 


Cys 


Gly 


Gin 
335 


He 


Ala 


Ser 


Tyr 


Pro 
340 


Leu 


Ala 


Leu 


Vai 


Arg 
345 


Thr 


Arg 


Met 


Gin 


Ala 
350 


Gin 


Ala 


Ser 


He 


Glu 
355 


Gly 


Gly 


Pro 


Gin 


Leu 
360 


Ser 


Met 


Leu 


Gly 


Leu 
365 


Leu 


Arg 


His 


He 


Leu 
370 


Ser 


Gin 


Glu 


Gly 


Met 
375 


Arg 


Gly 


Leu 


Tyr 


Arg 
380 


Gly 


He 


Ala 


Pro 


Asn 
385 


Phe 


Met 


Lys 


Vai 


He 
390 


Pro 


Ala 


Vai 


Ser 


He 
395 


Ser 


Tyr 


Vai 


Vai 


Tyr 
400 


Glu 


Asn 


Met 


Lys 


Gin 
405 


Ala 


Leu 


Gly 


Vai 


Thr 


Ser 


Arg 


Leu 


Glu 


Tyr 


Ser 


Gly 


Ser 


He 


Ser 



40/83 



1NS0OCI0: <WO 024641 SA2J A> 



410 



Asp His Cys Asn 


Leu 


Cys 


Leu 


Pro 




425 








Ser Ala Ser Arg Val 


Ala Gly He 




440 








Gin Ala His Leu Gly 


Leu Val Gly 




455 








Ser Leu Pro Thr 


Cys 


Trp Asp 


Tyr 




470 
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Met Ala Gly Leu 


Arg 


Asn 


Glu 


Ser 


1 


5 








Asp Thr Pro Gly 


Ser 


Arg 


Glu 


Trp 




20 








His Tyr Lys Ser 


Arg 


Trp Arg 


Ser 




35 








Met Phe Leu Ser 


Ser 


Val Gly 


Pne 




50 








Trp Pro Tyr Leu 


Gin 


Lys 


He 


ASp 




65 








Leu Gly Trp Val 


He 


Ala 


Ser 


Tyr 




80 








Ser Pro lie Phe 


Gly 


Leu 


Trp 


Ser 




95 








Pro Leu lie Val 


Ser 


He 


Leu 


lie 




110 








Tyr Ala Tyr Leu 


His 


He 


Pro 


Ala 




125 








Leu Val Ala Arg 


Gly 


Leu 


Leu 


Gly 




140 








Val Val Arg Ser 


Tyr 


Thr 


Ala 


Gly 




155 








Thr Ser Ser Met 


Ala 


Asn 


He 


Ser 




170 








He Leu Gly Pro 


Val 


Phe 


Gin 


Thr 




185 








Lys Gly Val Thr 


Trp 


Asp Val 


lie 




200 








Thr Thr Pro Val 


Leu 


Leu 


Ser 


Ala 




215 








He Leu He Leu 


Ala 


lie 


Leu 


Arg 




230 








Gly Arg Gin Cys 


Lys 


Ser 


lie 


Asn 




245 








Glu Ala Gin Val 


Pro 


Gin Gly 


Asn 




260 








Ala He Asn Val 


Leu 


Phe 


Phe 


Val 




275 








Phe Glu Thr He 


He 


Thr 


Pro 


Leu 




290 








Thr Gin Glu Gin 


Ala 


Val 


Leu 


Tyr 




305 









415 










420 


Gly Ser 


Ser 


Asp 


Ser 


Pro 


Ala 


430 










435 


Thr Gly 


Phe 


His 


His 


Val 


Ala 


445 










450 


Ser Arg 


Asn 


Ser 


Ala 


Ala 


Phe 


460 










465 


Arg Lys 


Pro 


Val 


Val 


Met 


Pro 


475 










480 



Glu 


Gin 


Glu Pro Leu Leu Gly 




10 


15 


Asp 


lie 


Leu Glu Thr Glu Glu 




25 


30 


lie Arg 


He Leu Tyr Leu Thr 




40 


45 


Ser 


Val 


Val Met Met Ser He 




55 


60 


Pro 


Thr 


Ala Asp Thr Ser Phe 




70 


75 


Ser 


Leu 


Gly Gin Met Val Ala 




85 


90 


Asn 


Tyr 


Arg Pro Arg Lys Glu 




100 


105 


Ser 


Val 


Ala Ala Asn Cys Leu 




115 


120 


Ser 


His 


Asn Lys Tyr Tyr Met 




130 


135 


lie Gly 


Ala Gly Asn Val Ala 




145 


150 


Ala 


Thr 


Ser Leu Gin Glu Arg 




160 


165 


Met 


Cys 


Gin Ala Leu Gly Phe 




175 


180 


Cys 


Phe 


Thr Phe Leu Gly Glu 




190 


195 


Lys 


Leu 


Gin lie Asn Met Tyr 




205 


210 


Phe 


Leu 


Gly lie Leu Asn lie 




220 


225 


Glu 


His 


Arg Val Asp Asp Ser 




235 


240 


Phe 


Glu 


Glu Ala Ser Thr Asp 




250 


255 


lie 


Asp 


Gin Val Ala Val Val 




265 


270 


Thr 


Leu 


Phe lie Phe Ala Leu 




280 


285 


Thr 


Met 


Asp Met Tyr Ala Trp 




295 


300 


Asn Gly 


lie He Leu Ala Ala 




310 


315 
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Leu 


Gly Val 


Glu 


Ala 


Val 


Val 


He 


Phe Leu Gly Val 


Lys Leu 


Leu 










320 








325 




330 


Ser 


Lys 


Lys 


He 


Gly 


Glu 


Arg 


Ala 


He Leu Leu Gly 


Gly Leu 


He 










335 








340 




345 


Val 


Val 


Trp Val 


Gly 


Phe 


Phe 


He 


Leu Leu Pro Trp 


Gly Asn 


Gin 










350 








355 




360 


Phe 


Pro 


Lys 


He 


Gin 


Trp 


Glu 


Asp 


Leu His Asn Asn 


Ser He 


Pro 










365 








370 




375 


Asn 


Thr 


Thr 


Phe 


Gly Glu 


He 


He 


He Gly Leu Trp 


Lys Ser 


Pro 










380 








385 




390 


Met 


Glu 


Asp Asp 
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Arg 


Pro Thr Gly Cys Ser 


He Glu 


Gin 
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400 




405 


Ala 


Trp 


Cys 


Leu 


Tyr 


i nr 


Pro 


Val 


He His Leu Ala 


Gin Phe 


Leu 










410 








415 




420 


Thr 


Ser 


Ala 


Val 


Leu 


lie 


Gly 


Leu Gly Tyr Pro Val 


Cys Asn 


Leu 










425 








430 




435 


Met 


Ser 


Tyr 


Thr 


Leu 


Tyr 


Ser 


Lys 


He Leu Gly Pro 


Lys Pro 


Gin 










440 








445 




450 


Gly 


Val 


Tyr 


Met 


Gly 


Trp 


Leu 


Thr 


Ala Ser Gly Ser 


Gly Ala 


Arq 










455 








460 




465 


He 


Leu 


Gly 


Pro 


Met 


Phe 


He 


Ser 


Gin Val Tyr Ala 


His Trp 


Glv 










470 








475 




480 


Pro 


Arg 


Trp 


Ala 


Phe 


Ser 


Leu 


Val 


Cys Gly He He 


Val Leu 


Thr 










465 








490 




495 


He 


Thr 


Leu 


Leu 


Gly 


Val 


Val 


Tyr 


Lys Arg Leu He 


Ala Leu 


Ser 










500 








505 




510 


Val 


Arg Tyr Gly 


Arg 


He 


Gin 


Glu 
















515 
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Met 


Arg 


Ser Ser 


Leu 


Ala 


Pro Gly Val 


Trp Phe Phe 


Arg Ala 


Phe 


1 






5 










10 




15 


Ser 


Arg 


Asp Ser 


Trp 


Phe 


Arg 


Gly 


Leu 


He Leu Leu 


Leu Thr 


Phe 








20 










25 




30 


Leu 


He 


Tyr Ala 


Cys 


Tyr 


His 


Met 


Ser 


Arg Lys Pro 


He Ser 


He 








35 










40 




45 


Val 


Lys 


Ser Arg 


Leu 


His 


Gin 


Asn 


Cys 


Ser Glu Gin 


He Lys 


Pro 








50 










55 




60 


He 


Asn 


Asp Thr 


His 


Ser 


Leu 


Asn 


Asp 


Thr Met Trp 


Cys Ser 


Trp 








65 










70 




75 


Ala 


Pro 


Phe Asp 


Lys 


Asp Asn 


Tyr 


Lys 


Glu Leu Leu 


Gly Gly Val 








80 










85 




90 


Asp 


Asn 


Ala Phe 


Leu 


He 


Ala 


Tyr 


Ala 


He Gly Met 


Phe He 


Ser 








95 










100 




105 


Gly 


Val 


Phe Gly 


Glu 


Arg 


Leu 


Pro 


Leu 


Arg Tyr Tyr 


Leu Ser 


Ala 








110 










115 




120 


Gly 


Met 


Leu Leu 


Ser 


Gly Leu 


Phe 


Thr 


Ser Leu Phe 


Gly Leu Gly 








125 










130 




135 


Tyr 


Phe 


Trp Asn 


He 


His 


Glu 


Leu 


Trp 


Tyr Phe Val 


Val He 


Gin 








140 










145 




150 


Val 


Cys 


Asn Gly 


Leu 


Val 


Gin 


Thr 


Thr Gly Trp Pro 


Ser Val 


Val 








155 










160 




165 


Thr 


Cys 


Val Gly 


Asn 


Trp 


Phe 


Gly 


Lys 


Gly Lys Arg 


Gly Phe 


He 








170 










175 




180 
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Met Gly lie Trp 


Asn 


Ser 


His 


Thr Ser Val Gly Asn He Leu 


Gly 




185 






190 


195 


Ser Leu lie Ala 


Gly 


He 


Trp 


Val Asn Gly Gin Trp Gly Leu 


Ser 




200 






205 


210 


Phe lie Val Pro 


Gly He 


lie 


Thr Ala Val Met Gly Val He 


Thr 




215 






220 


225 


Phe Leu Phe Leu 


He 


Glu 


His 


Pro Glu Asp Val Asp Cys Ala 


Pro 




230 






235 


240 


Pro Gin His His 


Gly Glu 


Pro 


Ala Glu Asn Gin Asp Asn Pro 


Glu 




245 






250 


255 


Asp Pro Gly Asn 


Ser 


Pro 


Cys 


Ser lie Arg Glu Ser Gly Leu 


Glu 




260 






265 


270 


Thr Val Ala Lys 


Cys 


Ser 


Lys 


Gly Pro Cys Glu Glu Pro Ala 


Ala 




275 






280 


285 


lie Ser Phe Phe 


Gly Ala 


Leu 


Arg lie Pro Gly Val Val Glu 


Phe 




290 






295 


300 


Ser Leu Cys Leu 


Leu 


Phe 


Ala 


Lys Leu Val Ser Tyr Thr Phe 


Leu 




305 






310 


315 


Tyr Trp Leu Pro 


Leu 


Tyr 


lie 


Ala Asn Val Ala His Phe Ser 


Ala 




320 






325 


330 


Lys Glu Ala Gly 


Asp 


Leu 


Ser 


Thr Leu Phe Asp Val Gly Gly 


He 




335 






340 


345 


He Gly Gly He 


Val 


Ala 


Gly 


Leu Val Ser Asp Tyr Thr Asn 


Gly 




350 






355 


360 


Arg Ala Thr Thr 


Cys 


Cys 


Val 


Met Leu lie Leu Ala Ala Pro 


Met 




365 






370 


375 


Met Phe Leu Tyr 


Asn 


Tyr 


He 


Gly Gin Asp Gly He Ala Ser 


Ser 




380 






385 


390 


He Val Met Leu 


He 


He 


Cys 


Gly Gly Leu Val Asn Gly Pro 


Tvr 




395 






400 


405 


Ala Leu He Thr 


Thr 


Ala 


Val 


Ser Ala Asp Leu Gly Thr His 


Lvs 




410 






415 


420 


Ser Leu Lys Gly 


Asn 


Ala 


Lys 


Ala Leu Ser Thr Val Thr Ala 


He 


425 






430 


435 
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He 


Asn 


Gly Gin Leu 


Ala 


Thr 






680 






Asn 


Val 


Thr Phe Ala 


Pro 


Gly 






695 






Glu 


Val 


Leu Ala Asp Asp Val 






710 






Val 


Gin 


Leu Thr Asp 


Ala 


Ser 






725 






Arg 


He 


Ala Asn He 


He 


He 






740 






Thr 


Val 


Ala Phe Ala 


Gin 


Met 






755 






Glu 


Arg 


Ser Ser Cys 


Ala 


Asn 






770 






His 


Phe 


Gly Arg Leu 


Leu 


Leu 






785 






Val 


Val 


Ala Leu Ala 


Met 


Glu 






800 






Tyr 


Glu 


Ser Pro lie 


Gin 


Gly 






815 






Trp 


Met 


Asn Val Ser 


Ala 


Val 






830 






Thr 


Leu 


Cys Leu Lys 


Glu 


Gin 






845 






Ser 


Ala 


Ser Glu Gly 


Pro 


Gin 






860 






Ser 


Pro 


Ala Val Asn 


Asn 


Ser 






875 






Met 


Thr 


Arg Val Ala 


Ser 


Leu 






890 






Ser 


Asp 


Tyr Glu Pro 


Val 


Thr 






905 






Gly 


Asp 


Glu Phe Ala 


Asn 


Leu 






920 






Phe 


Pro 


Glu Met Asp 


Glu 


Ser 






935 






His 


Leu 


Met Asn He 


Ser 


Ala 






950 






Gly 


Gin 


Pro Asn He 


Ser 


Thr 






965 






Ala 


Phe 


Gly Val Phe 


Val 


He 






980 






Glu Asp Gly Leu Phe 


Val 


Glu 






995 






Val 


Glu 


Leu Met He 


His 


Arg 






1010 






Ala 


Val 


Glu Trp Arg 


Val 


Val 






1025 






Asp 


Phe 


He Gly Ala Gly Glu 






1040 






Thr 


Lys 


Lys Thr Val 


He 


Leu 






1055 






Glu 


Asp 


Asp Glu Ser 


He 


He 






1070 






Gly 


Ser 


Arg He Leu 


Pro 


Ser 






1085 






Leu 


Ala 


Asn Asp Asn 


Val 


Ala 



Gin 


He 


Thr 


Lys 


Leu 


He 


Val Glu 






640 








645 


Lys 


Val 


Asn 


Leu 


Pro 


He 


He Arg 






655 








660 


Val 


Thr 


Val 


Gin 


Trp 


Val 


Ala Thr 






670 








675 


Gly Asp Leu Arg Val 


Val 


Ser Gly 






685 








690 


Glu 


Thr 


He 


Gin 


Thr 


Leu 


Leu Leu 






700 








705 


Pro 


Glu 


He 


Glu 


Glu 


Val 


He Gin 






715 








720 


Gly Gly Gly Thr 


He Gly Leu Asp 






730 








735 


Pro 


Ala 


Asn 


Asp Asp 


Pro 


Tyr Gly 






745 








750 


Val 


Tyr Arg 


Val 


Gin 


Glu 


Pro Leu 






760 








765 


He 


Thr 


Val 


Arg 


Arg 


Ser Gly Gly 






775 








780 


Phe 


Tyr 


Ser 


Thr 


Ser 


Asp 


He Asp 






790 








795 


Glu 


Gly 


Gin 


Asp 


Leu 


Leu 


Ser Tyr 






805 








810 


Val 


Pro 


Asp 


Pro 


Leu 


Trp Arg Thr 






820 








825 


Gly 


Glu 


Pro 


Leu 


Tyr 


Thr 


Cys Ala 






835 








840 


Ala 


Cys 


Ser 


Ala 


Phe 


Ser 


Phe Phe 






850 








855 


Cys 


Phe 


Trp 


Met 


Thr 


Ser 


Trp He 






865 








87 0 


Asp 


Phe 


Trp 


Thr 


Tyr 


Arg 


Lys Asn 






880 








885 


Phe 


Ser 


Gly Gin Ala 


Val 


Ala Gly 






895 








900 


Arg 


Gin 


Trp 


Ala 


He 


Met 


Gin Glu 






910 








915 


Thr 


Val 


Ser 


He 


Leu 


Pro 


Asp Asp 






925 








930 


Phe 


Leu 


He 


Ser 


Leu 


Leu 


Glu Val 






940 








945 


Ser 


Leu- Lys 


Asn 


Gin 


Pro 


Thr He 






955 








960 


Val 


Val 


He 


Ala 


Leu Asn Gly Asp 






970 








975 


Tyr 


Asn 


He 


Ser 


Pro 


Asn 


Thr Ser 






985 








990 


Val 


Gin 


Glu 


Gin 


Pro 


Gin 


Thr Leu 




1000 








1005 


Thr 


Gly Gly 


Ser 


Leu 


Gly 


Gin Val 




1015 








1020 


Gly Gly Thr Ala 


Thr 


Glu 


Gly Leu 




1030 








1035 


He 


Leu 


Thr 


Phe 


Ala 


Glu Gly Glu 




1045 








1050 


Thr 


He 


Leu 


Asp 


Asp 


Ser 


Glu Pro 




1060 








1065 


Val 


Ser 


Leu 


Val 


Tyr 


Thr 


Glu Gly 




1075 








1080 


Ser 


Asp Thr 


Val 


Arg 


Val 


Asn He 




1090 








1095 


Gly 


He 


Val 


Ser 


Phe 


Gin 


Thr Ala 
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1100 


Ser 


Arg 


Ser 


Val He 








1115 


val 


He 


Arg 


Thr Phe 








1130 


Lys 


He 


He 


Gly Gin 








1145 


Gly 


Gin 


Leu 


Phe Phe 






1160 


Val 


His 


Leu 


Leu Asp 








1175 


Gin 


Val 


He 


Leu Tyr 








1190 


Gly 


He 


Ala 


Leu Leu 








1205 


Val 


Glu 


Ala 


Ser Asp 








1220 


Ser 


Ser 


Arg 


Phe Val 








1235 


Leu 


Phe 


He 


Asn Arg 








1250 


Thr 


Tyr 


Thr 


Thr Val 








1265 


Val 


Gly 


Asn 


Leu Ala 








1280 


Gly 


Phe 


Leu 


He Leu 








1295 


He 


Thr 


He 


Leu Glu 








1310 


Leu 


Val 


Asn 


Leu Thr 








1325 


Ser 


Phe 


Pro 


Pro Arg 








1340 


He 


He 


Asp 


Ala Asn 








1355 


Gin 


Ser 


Arcr 


Phe Glu 








1370 


Val 


Ala 


Gin 


Arg Ser 








1385 


Val 


Tvr 


Ala 


Gin Asn 








1400 


Phe 


Thr 


Pro 


Met He 






1415 


Asn 


Val 


Asn 


He Met 








1430 


Glu 


Lys 


Phe 


Gin Leu 








1445 


Leu 


Gly 


Lys 


Asn Thr 








1460 


Asp 


Gly 


Pro 


Gly Val 








1475 


Leu 


Arg 


Glu^ 


Pro Thr 








1490 


Leu 


Tvr 


He 


Val Arg 








1505 


Thr 


Val 


Gin 


Phe He 








1520 


Lys 


Asp 


Leu 


Thr Pro 








1535 


Val 


Arg 


Phe 


Lys Ala 








1550 


Pro 


Glu 


Met 


Asp Glu 








1565 









1105 


Gly 


His 


Glu 


Gly Glu 








1120 


Pro 


Gly 


Arg 


Gly Asn 








1135 


Asn 


Gin 


Glu 


Leu Asn 








1150 


Pro 


Glu 


Gly 


Ser Leu 








1165 


Asp 


Asn 


He 


Pro Glu 








1180 


Asp 


Val 


Arg 


Thr Gin 








1195 


Asp 


Ala 


Gin 


Gly Tyr 








1210 


Glu 


Pro 


His 


Gly Val 








1225 


Leu 


Leu 


Gin 


Glu Ala 








1240 


Glu 


Phe 


Gly 


Ser Leu 








1255 


Pro 


Gly 


Met 


Leu Ser 








1270 


Glu 


Pro 


Glu 


Val Asp 








1285 


Glu 


Glu 


Gly 


Glu Thr 








1300 


Asp 


Asp 


Val 


Pro Glu 








1315 


Tyr 


Val 


Gly 


Leu Thr 








1330 


Leu 


Asp 


Ser 


Glu Gly 








1345 


Asp 


Glv 


Ala 


Ara Glv 








1360 


Val 


Asn 


Glu 


Thr His 








1375 


Ara 


Glu 


Pro 


Leu Gly 








1390 


Leu 


Glu 


Ala 


Gin Val 








1405 


Leu 


His 


Phe 


Ala Asp 








1420 


He 


Leu 


Asp 


Asp Asp 








1435 


He 


Leu 


Thr 


Asn Pro 








1450 


lie 


Ala 


Leu 


He lie 








1465 


Leu 


Ser 


Phe 


Asn Asn 








1480 


Ala 


Leu 


Tyr 


Val Gin 








1495 


Glu 


Pro 


Ala 


Gin Gly 








1510 


Val 


Thr 


Glu 


Val Asn 








1525 


Ser 


Lys 


Gly 


Tyr lie 








1540 


Leu 


Gin 


He 


Ser Ala 








1555 


Tyr 


Phe 


Val 


Cys Thr 








1570 









1110 


lie 


Leu 


Gin 


Phe His 








1125 


Val 


Thr 


Val 


Asn Trp 








1140 


Phe 


Ala 


Asn 


Phe Ser 








1155 


Asn 


Thr 


Thr 


Leu Phe 








1170 


Glu 


Lys 


Glu 


Val Tyr 








1185 


Gly Val 


Pro 


Pro Ala 








1200 


Ala 


Ala 


Val 


Leu Thr 








1215 


Leu 


Asn 


Phe 


Ala Leu 








1230 


Asn 


He 


Thr 


lie Gin 








1245 


Gly Ala 


He 


Asn Val 








1260 


Leu 


Lys Asn 


Gin Thr 








1275 


Phe 


Val 


Pro 


lie lie 








1290 


Ala 


Ala 


Ala 


He Asn 








1305 


Leu 


Glu 


Glu 


Tyr Phe 








1320 


Met 


Ala 


Ala 


Ser Thr 








1335 


Leu 


Thr 


Ala 


Gin Val 








1350 


Val 


lie 


Glu 


Trp Gin 








1365 


Gly Ser Leu 


Thr Leu 








1380 


His 


Val 


Ser 


Leu Phe 








1395 


Gly Leu Asp 


Tyr lie 








1410 


Gly Glu Arg 


Tyr Lys 








1425 


lie 


Pro 


Glu 


Gly Asp 








1440 


Ser 


Pro Gly 


Leu Glu 








1455 


Val 


Leu Ala 


Asn Asp 








1470 


Ser 


Glu 


His 


Phe Phe 








1485 


Glu 


Ser Val 


Ala Val 








1500 


Leu 


Phe Gly 


Thr Val 








1515 


Ser 


Ser 


Asn 


Glu Ser 








1530 


Val 


Leu 


Glu 


Glu Gly 








1545 


lie 


Leu Asp 


Thr Glu 








1560 


Leu 


Phe Asn 


Pro Thr 








1575 



49/83 



OOCID: <WO 024641 5A2JA> 



Gly 


Gly Ala 


Arg Leu 


Gly Val 


His 


Val Gin 


Thr 


Leu 


He 


Thr Val 




Gin 




1580 








1585 








1590 


Leu 


Asn 


Gin Ala 


Pro 


Leu 


Gly Leu Phe 


Ser 


He 


Ser 


Ala Val 








1595 








1600 








1605 


Glu 


Asn 


Arg 


Ala Thr 


Ser 


He 


Asp 


He Glu 


Glu 


Ala 


Asn 


Arg Thr 








1610 








1615 








1620 


Val 


Tyr 


Leu 


Asn Val 


Ser 


Arcr 


Thr 


Asn Gly He Asp Leu Ala Asp 








1625 








J. O J \J 








1635 


Leu 


Asn 


He 


Glu Asn 


Pro 


Lvs 


Thr 


LyS bill 


Aia 


rne 


Asn 


He Gly 








1640 








1645 








1650 


Phe 


Ser 


Pro 


Tyr Phe 


Val 


He 


Thr 


His Glu 


Glu 


Arg 


Asn 


Glu Glu 








1655 








1660 






1665 


Lys 


Pro 


Ser 


Leu Asn 


Ser 


Val 


rue 


Thr Phe 


Thr 


Ser 


Gly 


Phe Lys 








1670 








1675 








1680 


Leu 


Phe 


Leu 


Val Gin 


Thr 


He 


He 


He Leu 


Glu 


Ser 


Ser 


Gin Val 








1685 








1690 








1695 


Arg 


TVr 

* Jr ■*■ 


Phe 


Thr Ser 


Asp 


Ser 


Gin 


Asp Tyr 


Leu 


lie 


He 


Ala Ser 








1700 








1705 








1710 


Gin 


Arc? 


Ast> 


Asp Ser 


Glu 


Leu 


Thr 


Gin Val 


Phe 


Arg 


Trp 


Asn Gly 








1715 








1720 








1725 


Gly 


Ser 


Phe 


Val Leu 


His 


Gin 


Lys 


Leu Pro 


Val 


Arg 


Gly Val Leu 








1730 








1735 








1740 


Thr 


Val 


Ala 


Leu Phe 


Asn 


Lys 


Gly Gly Ser Val 


Phe 


Leu 


Ala He 








1745 








1750 








1755 


Ser 


Gin 


Ala 


Asn Ala 


Arg 


Leu 


Asn 


Ser Leu 


Leu 


Phe 


Arg 


Trp Ser 








1760 








1765 






1770 


Gly 


Ser 


Gly 


Phe He 


Asn 


Phe 


Gin 


Glu Val 


Pro 


Val 


Ser 


Gly Thr 








1775 








1780 








1785 


Thr 


Glu 


Val 


Glu Ala 


Leu 


Ser 


Ser 


Ala Asn 


Asp 


He 


Tyr 


Leu He 








1790 








1795 








1800 


Phe 


Ala 


Lvs 


Asn Val 


Phe 


Leu 


Gly Asp Gin Asn 


Ser 


He 


Asp He 








1805 








1810 








1815 


Phe 


lie 




Glu Met 


Gly Gin 


Ser 


Ser Phe Arg Tyr 


Phe 


Gin Ser 








1820 








1825 








1830 


Val 


Asp 


Phe 


Ala Ala 


Val 


Asn 


Arg 


He His 


Ser 


Phe 


Thr 


Pro Ala 








1835 








1840 








1845 


Ser 


Gly 


He 


Ala His 


He 


Leu 


Leu 


He Gly Gin 


Asp 


Met 


Ser Ala 








1850 








1855 






1860 


Leu 


Tyr 


Cvs 


Trp Asn 


Ser 


Glu 


Arg 


Asn Gin 


Phe 


Ser 


Phe 


Val Leu 








1865 








1870 








1875 


Glu 


Val 


Pro 


Ser Ala 


Tyr 


Asp Val 


Ala Ser 


Val 


Thr 


Val 


Lys Ser 








1880 








1885 








1890 


Leu 


Asn 


Ser 


Ser Lys 


Asn 


Leu 


He 


Ala Leu 


Val 


Gly Ala 


His Ser 








1895 








1900 








1905 


His 


He 


Tvr 


Glu Leu 


Ala 


Tyr 


He 


Ser Ser 


His 


Ser 


Asp 


Phe He 








1910 








1915 






1920 


Pro 


Ser 


Ser 


Gly Glu 


Leu 


He 


Phe 


Glu Pro 


Gly Glu Arg 


Glu Ala 








1925 








1930 








1935 


Thr 


He 


Ala 


Val Asn 


He 


Leu Asp 


Asp Thr 


Val 


Pro 


Glu 


Lys Glu 








1940 








1945 








1950 


Glu 


Ser 


Phe 


Lys Val 


Gin 


Leu 


Lys 


Asn Pro 


Lys 


Gly Gly Ala Glu 








1955 








1960 








1965 


He Gly He Asn Asp Ser Val 


Thr 


He Thr 


He 


Leu 


Ser 


Asn Asp 








1970 








1975 








1980 


Asp 


Ala 


Tyr 


Gly He 


Val 


Ala 


Phe 


Ala Gin 


Asn 


Ser 


Leu 


Tyr Lys 








1985 








1990 








1995 


Gin 


Val 


Glu 


Glu Met 


Glu 


Gin 


Asp 


Ser Leu 


Val 


Thr 


Leu 


Asn Val 








2000 








2005 








2010 


Glu 


Arg 


Leu 


Lys Gly Thr 


Tyr Gly Arg He 


Thr 


He 


Ala 


Trp Glu 








2015 








2020 








2025 


Ala 


Asp Gly Ser He 


Ser 


Asp 


He 


Phe Pro 


Thr 


Ser Gly Val Val 








2030 








2035 








2040 


Glu 


Lys 


Arg Met Ser 


Ala 


Lys 


He 


Leu Phe 


Thr 


Glu 


Gly Gin Val 
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2045 2050 2055 

Leu Ser Thr lie Thr Leu Thr lie Leu Ala Asp Asn lie Pro Glu 

2060 2065 2070 

Leu Ser Glu Val Val lie Val Thr Leu Thr Arg He Thr Thr Glu 

2075 2080 2085 

Gly Val Glu Asp Ser Tyr Lys Gly Ala Thr He Asp Gin Asp Arg 

2090 2095 2100 

Ser Lys Ser Val He Thr Thr Leu Pro Asn Asp Ser Pro Phe Gly 

2105 2110 2115 

Leu Val Gly Trp Arg Ala Ala Ser Val Phe He Arg Val Ala Glu 

2120 2125 2130 

Pro Lys Glu Asn Thr Thr Thr Leu Gin Leu Gin He Ala Arg Asp 

2135 2140 2145 

Lys Gly Leu Leu Gly Asp He Ala He His Leu Arg Ala Gin Pro 

2150 2155 2160 

Asn Phe Leu Leu His Val Asp Asn Gin Ala Thr Glu Asn Glu Asp 

2165 2170 2175 

Tyr Val Leu Gin Glu Thr He lie lie Met Lys Glu Asn He Lys 

2180 2185 2190 

Glu Ala His Ala Glu Val Ser lie Leu Pro Asp Asp Leu Pro Glu 

2195 2200 2205 

Leu Glu Glu Gly Phe lie Val Thr lie Thr Glu Val Asn Leu Val 

2210 2215 2220 

Asn Ser Asp Phe Ser Thr Gly Gin Pro Ser Val Arg Arg Pro Gly 

2225" 2230 2235 

Met Glu lie Ala Glu lie Met lie Glu Glu Asn Asp Asp Pro Arg 

2240 2245 2250 

Gly He Phe Met Phe His Val Thr Arg Gly Ala Gly Glu Val He 

2255 2260 2265 

Thr Ala Tyr Glu Val Pro Pro Pro Leu Asn Val Leu Gin Val Pro 

2270 2275 2280 

Val Val Arg Leu Ala Gly Ser Phe Gly Ala Val Asn Val Tyr Trp 

2285 2290 2295 

Lys Ala Ser Pro Asp Ser Ala Gly Leu Glu Asp Phe Lys Pro Ser 

2300 2305 2310 

His Gly lie Leu Glu Phe Ala Asp Lys Gin Val Thr Ala Met He 

2315 2320 2325 

Glu He Thr lie lie Asp Asp Ala Glu Phe Glu Leu Thr Glu Thr 

2330 2335 2340 

Phe Asn lie Ser Leu lie Ser Val Ala Gly Gly Gly Arg Leu Gly 

2345 2350 2355 

Asp Asp Val Val Val Thr Val Val lie Pro Gin Asn Asp Ser Pro 

2360 2365 2370 

Phe Gly Val Phe Gly Phe Glu Glu Lys Thr Val Met He Asp Glu 

2375 2380 2385 

Ser Leu Ser Ser Asp Asp Pro Asp Ser Tyr Val Thr Leu Thr Val 

2390 2395 2400 

Val Arg Ser Pro Gly Gly Lys Gly Thr Val Arg Leu Glu Trp Thr 

2405 2410 2415 

lie Asp Glu Lys Ala Lys His Asn Leu Ser Pro Leu Asn Gly Thr 

2420 2425 2430 

Leu His Phe Asp Glu Thr Glu Ser Gin Lys Thr lie Val Leu His 

2435 2440 2445 

Thr Leu Gin Asp Thr Val Leu Glu Glu Asp Arg Arg Phe Thr He 

2450 2455 2460 

Gin Leu lie Ser He Asp Glu Val Glu lie Ser Pro Val Lys Gly 

2465 2470 2475 

Ser Ala Ser lie lie lie Arg Gly Asp Lys Arg Ala Ser Gly Glu 

2480 2485 2490 

Val Gly He Ala Pro Ser Ser Arg His lie Leu lie Gly Glu Pro 

2495 2500 2505 

Ser Ala Lys Tyr Asn Gly Thr Ala lie lie Ser Leu Val Arg Gly 

2510 2515 2520 
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Pro 


Gly 


He 


Leu Gly Glu 


Val 


Thr 


Val Phe 


Trp Arg 


He 


Phe Pro 








2525 






2530 






2535 


Pro 


Ser 


Val 


Gly Glu Phe 


Ala 


Glu 


Thr Ser Gly Lys 


Leu 


Thr Met 








2540 






2545 






2550 


Arg 


Asp 


Glu 


Gin Ser Ala 


Val 


He 


Val Val 


He Gin 


Ala 


Leu Asn 








2555 






2560 






2565 


Asp 


Asp 


He 


Pro Glu Glu 


Lys 


Ser 


Phe Tyr 


Glu Phe 


Gin 


Leu Thr 








2570 






2575 






2580 


Ala 


Val 


Ser 


Glu Gly Gly Val 


Leu 


Ser Glu 


Ser Ser 


Ser 


Thr Ala 








2585 






2590 






2595 


Asn 


He 


Thr 


Val Val Ala 


Ser 


Asp 


Ser Pro 


Tyr Gly Arg 


Phe Ala 








2600 






2605 






2610 


Phe 


Ser 


His 


Glu Gin Leu 


Arg 


Val 


Ser Glu 


Ala Gin 


Arg 


Val Asn 








2615 






2620 




2625 


He 


Thr 


He 


He Arg Ser 


Ser 


Gly Asp Phe Gly His 


Val 


Arg Leu 








2630 






2635 






2640 


Trp 


Tyr 


Lys 


Thr Met Ser Gly 


Thr 


Ala Glu 


Ala Gly 


Leu 


Asp Phe 








2645 






2650 






2655 


Val 


Pro 


Ala 


Ala Gly Glu 


Leu 


Leu 


Phe Glu 


Ala Gly 


Glu 


Met Arg 








2660 






2665 






2670 


Lys 


Ser 


Leu 


His Val Glu 


He 


Leu 


Asp Asp 


Asp Tyr 


Pro 


Glu Gly 








2675 






2680 






2685 


Pro 


Glu 


Glu 


Phe Ser Leu 


Thr 


He 


Thr Lys 


Val Glu 


Leu 


Gin Gly 








2690 






2695 






2700 


Arg 
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Met 


Asp 


Thr 


Pro 


Arg Gly He 


Gly Thr 


Phe 


Val 


Val 


Trp 


Asp 


Tyr 


1 








5 




10 










15 


Val 


Val 


Phe 


Ala 


Gly Met Leu 


Val He 


Ser 


Ala 


Ala 


He 


Gly 


He 










20 




25 










30 


Tyr 


Tyr 


Ala 


Phe 


Ala Gly Gly Gly Gin 


Gin 


Thr 


Ser 


Lys 


Asp 


Phe 










35 




40 










45 


Leu 


Met 


Gly 


Gly Arg Arg Met 


Thr Ala 


Val 


Pro 


Val 


Ala 


Leu 


Ser 










50 




55 










60 


Leu 


Thr 


Ala 


Ser 


Phe Met Ser 


Ala Val 


Thr 


Val 


Leu 


Gly 


Thr 


Pro 










65 




70 










75 


Ser 


Glu 


Val 


Tyr 


Arg Phe Gly 


Ala He 


Phe 


Ser 


He 


Phe 


Ala 


Phe 










80 




85 










90 


Thr 


Tyr 


Phe 


Phe 


Val Val Val 


He Ser 


Ala 


Glu 


Val 


Phe 


Leu 


Pro 










95 




100 










105 


Val 


Phe 


Tyr 


Lys 


Leu Gly He 


Thr Ser 


Thr 


Tyr 


Glu 


Tyr 


Leu 


Glu 










110 




115 










120 


Leu 


Arg 


Phe 


Asn 


Lys Cys Val 


Arg Leu 


Cys 


Gly Thr Val 


Leu 


Phe 










125 




130 










135 


He 


Val 


Gin 


Thr 


He Leu Tyr 


Thr Gly 


He 


Val 


He 


Tyr 


Ala 


Pro 










140 




145 










150 


Ala 


Leu 


Ala 


Leu 


Asn Gin Val 


Thr Gly 


Phe 


Asp 


Leu 


Trp 


Gly 


Ala 










155 




160 










165 


Val 


Val 


Ala 


Thr 


Gly Val Val 


Cys Thr 


Phe 


Tyr 


Cys 


Thr 


Leu 


Gly 










170 




175 










180 


Gly 


Leu 


Lys 


Ala 


Val He Trp 


Thr Asp 


Val 


Phe 


Gin 


Val 


Gly 


He 










185 




190 








195 
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Met 


Val 


Ala Gly 


Phe 


Ala Ser 










200 




Gin 


Gly 


Gly 


He 


Ser 


Thr lie. 










215 




Arg 


Leu 


Asn 


Phe 


Trp 


Asn Phe 










230 




Thr 


Phe 


Trp Thr 


He 


lie lie 










245 




He 


Tyr 


Gly Val 


Asn 


Gin Ser 










260 




Lys 


Ser 


Arg 


Phe 


Gin 


Ala Lys 










275 




Gly 


Leu 


Trp Ala 


He 


Leu Thr 










290 




Leu 


Tyr 


Ser 


Arg 


Tyr 


His Asp 










305 




Val 


Ser 


Ala 


Pro 


Asp 


Gin Leu 










320 




Leu 


Gin 


Asp 


Tyr 


Pro 


Gly Leu 










335 




Tyr 


Ser 


Gly Thr 


Leu 


Ser Thr 










350 




Ala 


Ala 


Val 


Thr 


Val 


Glu Asp 










365 




Leu 


Ser 


Glu 


Arg 


Ser 


Leu Ser 










380 




Val 


Tyr 


Gly Ala 


Leu 


Cys He 










395 




Met 


Gly 


Ala 


Leu 


Leu 


Gin Ala 










410 




Gly 


Gly 


Pro 


Leu 


Met 


Gly Leu 










425 




Phe 


Ala 


Asn 


Ser 


He 


Gly Ala 










440 




Ala 


He 


Ser 


Leu 


Trp 


Val Gly 










455 




Leu 


Pro 


Glu 


Arg 


Thr 


Leu Pro 










470 




Asn 


Ser 


Thr 


Tyr 


Asn 


Glu Thr 










485 




Pro 


Phe 


Thr 


Thr 


Ser 


Val Phe 










500 




Pro 


Leu 


Met 


Asp 


Asn 


Trp Tyr 










515 




Thr 


Val 


Gly Thr 


Leu 


Val Thr 










530 




Leu 


Ser 


Thr Gly 


Gly 


Arg Lys 










545 




Leu 


Thr 


Lys 


Glu 


Asp 


Phe Leu 










560 




Lys 


Lys 


His 


Val 


Leu 


Ser Tyr 










575 




Gly 


Thr 


Asp Asn 


Pro 


Ala Phe 










590 




Gin 


Ser 


Gly Lys 


Ser 


Asn Gly 



605 

<210> 32 
<211> 552 
<212> PRT 

<213> Homo sapiens 
<220> 



Val 


He He 


Gin Ala Val 


Val 


Met 




205 






210 


Leu 


Asn Asp 


Ala Tyr Asp 


Gly Gly 




220 






225 


Asn 


Pro Asn 


Pro Leu Gin 


Arg 


His 




235 






240 


Gly Gly Thr 


Phe Thr Trp 


Thr 


Ser 




250 






255 


Gin Val Gin 


Arg Tyr lie 


Ser Cys 




265 






270 


Leu 


Ser Leu 


Tyr lie Asn 


Leu 


Val 




280 






A O J 


Cys 


Ser Val 


Phe Cys Gly 


Leu 


Ala 




295 






ion 


Cys 


Asp Pro 


Trp Thr Ala 


Lys 






310 






J x ~t 


Met 


Pro Tyr 


Leu Val Leu 


Asp 


He 




325 






_> _> u 


Pro 


Gly Leu 


Phe Val Ala 


Cys 


Ala 




340 






345 


Val 


Ser Ser 


Ser lie Asn 


Ala 


Leu 




355 






360 


Leu 


He Lys 


Pro Tyr Phe 


Arg 


Ser 




370 






375 


Trp 


He Ser 


Gin Gly Met 


Ser 


Val 




385 






390 


Gly Met Ala 


Ala Leu Ala 


Ser 


Leu 




400 






405 


Ala 


Leu Ser 


Val Phe Gly 


Met 


Val 




415 






420 


Phe 


Ala Leu 


Gly lie Leu 


Val 


Pro 




430 






435 


Leu 


Val Gly 


Leu Met Ala 


Gly 


Phe 




445 






450 


He Gly Ala 


Gin lie Tyr 


Pro 


Pro 




460 






465 


Leu 


His Leu 


Asp lie Gin 


Gly Cys 




475 






480 


Asn 


Leu Met 


Thr Thr Thr 


Glu 


Met 




490 






495 


Gin 


lie Tyr 


Asn Val Gin 


Arg 


Thr 




505 






510 


Ser 


Leu Ser 


Tyr Leu Tyr 


Phe 


Ser 




520 






525 


Leu 


Leu Val 


Gly lie Leu 


Val 


Ser 




535 






540 


Gin 


Asn Leu 


Asp Pro Arg 


Tyr 


He 




550 






555 


Ser 


Asn Phe 


Asp lie Phe 


Lys 


Lys 




565 






570 


Lys 


Ser His 


Pro Val Glu 


Asp 


Gly 




580 






585 


Asn 


His lie 


Glu Leu Asn 


Ser 


Asp 




595 






600 


Thr 


Arg Leu 










610 
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<221> misc_f eature 
<223> Incyte ID No: 



7487393CD1 



<400> 32 



Met 


Gly 


Phe 


Asn 


Val 


Leu 


Leu 


Asp 


Gin 


Val 


Glv 


Glv 


Met 


Glv 




1 








5 










1 0 










X — > 


Phe 


Gin 


He 


Cys 


Leu 
20 


He 


Ala 


Phe 


Phe 


Cys 

9^ 


He 


Thr 


Asn 


He 


Leu 
o 


Leu 


Phe 


Pro 


Asn 


He 
35 


Val 


Leu 


Glu 


Asn 


Phe 

A 0 


Thr 


Ala 


Phe 


Thr 


Pro 

A t% 


Ser 


His 


Arg 


Cys 


Trp 
•j \j 


Val 


Pro 


Leu 


Leu 


Asp 

D D 


Asn 


Asp 


Thr 


Val 


Ser 
o u 


Asp 


Asn 


Asp 


Thr 


Gly 


Thr 


Leu 


Ser 


Lys 


Asp 

9 f\ 


Asp 


Leu 


Leu 


Arg 


He 

/ D 


Ser 


He 


Pro 


Leu 


Asp 


Ser 


Asn 


Leu 


Arg 


Pro 


Gin 


Lys 


Cys 


Gin 


Arg 










ho 


















q n 


Phe 


He 


His 


Pro 


Gin 


Trp 


Gin 


Leu 


Leu 


His 
i on 


Leu 


Asn 


Gly 


Thr 


Phe 


Pro 


Asn 


Thr 


Asn 


Clu 
lift 


Pro 


Asp 


Thr 


Glu 


Pro 

± ±D 


Cys 


Val 


Asp 


Gly 


Trp 
ion 


Val 


Tyr 


Asp 


Arg 


Set 


Ser 


Phe 


Leu 


Ser 


Thr 

1 


He 


Val 


Thr 


Glu 


Trp 

1 J j 


Asp 


Leu 


Val 


Cys 


Glu 


Ser 


Gin 


Ser 


Leu 


Lys 

14 J 


Ser 


Met 


Val 


Gin 


Ser 

i cn 


Leu 


Phe 


Met 


Ala 


Gly 

A J J 


Ser 


Leu 


Leu 


Gly 


Gly 

1 fiO 


Leu 


He 


Tyr 


Gly 


His 

ID J 


Leu 


Ser 


Asp 


Arg 


170 


Gly 


Arg 


Lys 


He 


He 

X / D 


Cys 


Lys 


Leu 


Cys 


Phe 

1 QO 

XoU 


Leu 


Gin 


Leu 


Ala 


He 

1 es 

1 O J 


Ser 


Asn 


Thr 


Cys 


Ala 

1 QO 
X u 


Ala 


Phe 


Ala 


Pro 


Thr 

1 QC 


Phe 


Leu 


Val 


Tyr 


cys 

500 


He 


Leu 


Arg 


Phe 


Leu 


Ala 


Gly 


Phe 


Ser 


Thr 

Z JL U 


Met 


Thr 


He 


Leu 


Gly 

* x -> 


Asn 


Thr 


Phe 


He 


Leu 

9 90 


Ser 


Leu 


Glu 


Trp 


Thr 

one 
ZZD 


Leu 


Pro 


Arg 


Ser 


Arg 

? **o 


Ser 


Met 


Thr 


He 


Met 
9 1 e; 


Val 


Leu 


Leu 


Cys 


Ser 


Tyr 


Ser 


Val 


Gly 


Gin 
^ *• j 


Met 


Leu 


Leu 


Gly 


Gly 


Leu 


Ala 


Phe 


Ala 


He 

occ 

ZJJ 


Gin 


Asp 


Trp 


His 


He 

260 
^ u w 


Leu 


Gin 


Leu 


Thr 


Val 

ZO J 


Ser 


Thr 


Pro 


He 


He 

9"70 
Z / U 


Val 


Leu 


Phe 


Leu 


Ser 


Ser 


Trp 


Lys 


Met 


Val 
9 ftn 


Glu 


Ser 


Ala 


Arg 


Trp 


Leu 


He 


He 


Asn 


Asn 

290 


Gin 


Leu 


Asp 


Glu 


Gly 

Z7j 


Leu 


Lys 


Glu 


Leu 


Arg 

jUU 


Arg 


Val 


Ala 


His 


He 
305 


Asn 


oxy 


Lys 


T A/C 


noil 

iin 

-> X \-/ 


Thr 


VjIU 




1 l IX 


Leu 

J X _> 


Thr 


Thr 


Glu 


Leu 


Val 
320 


Arg 


Ser 


Thr 


Met 


Lys 
**95 


Lys 


Glu 


Leu 


Asp 


Ala 

"*"*0 

J JU 


Val 


Arg 


He 


Lys 


Thr 
335 


Ser 


He 




Cor 


T .01 i 




Arg 


Ala 


rXO 


T \7G 

-L» jr 2d 
.5 H J 


Leu 


Arg 


Met 


Arg 


Val 
350 


Phe 


Glv 


Leu 


X S3 


-j -} -j 


val 




rile 


Ala 


Tip 
lie 

J o VJ 


Thr 


Val 


Pro 


Phe 


Tyr 
365 


Gly 


Leu 


He 


Leu 


Asn 
370 


Leu 


Gin 


His 


Leu 


Glv 
375 


Ser 


Asn 


Val 


Ser 


Leu 


Phe 


Gin 


He 


Leu 


Cys 


Gly 


Ala 


Val 


Thr 


Phe 










380 










385 








390 


Thr 


Ala 


Arg 


Cys 


Val 
395 


Ser 


Leu 


Leu 


Thr 


Leu 
400 


Asn 


His 


Met 


Gly 


Arg 
405 


Arg 


He 


Ser 


Gin 


He 


Leu 


Phe 


Thr 


Phe 


Pro 


Val 


Gly 


Leu 


Phe 


He 










410 










415 








420 


Leu 


Val 


Asn 


Thr 


Phe 


Leu 


Pro 


Gin 


Glu 


Met 


Gin 


He 


Leu 


Arg 


Val 










425 










430 








435 


Val 


Leu 


Ala 


Thr 


Leu 


Gly 


He 


Gly 


Ser 


Val 


Ser 


Ala 


Ala 


Ser 


Asn 
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440 445 450 



Ser 


Ala 


Ser 


Val 


His 


His 


Asn 


Glu 


Leu 


Val 


Pro Thr 


He 


Leu 


Arg 










455 










460 








465 


Ser 


Thr 


Val 


Ala 


Gly 


He 


Asn 


Ala 


Val 


Ser 


Gly Arg 


Thr 


Gly 


Ala 










470 










475 








480 


Ala 


Leu 


Ala 


Pro 


Leu 


Leu 


Met 


Thr 


Leu 


Met 


Ala Tyr 


Ser 


Pro 


His 










485 










490 








495 


Leu 


Pro 


Trp 


He 


Ser 


Tyr 


Gly 


Val 


Phe 


Pro 


He Leu 


Ala 


Val 


Pro 










500 










505 








510 


Val 


He 


Leu 


Leu 


Leu 


Pro 


Glu 


Thr 


Arg 


Asp 


Leu Pro 


Leu 


Pro 


Asn 










515 










520 








525 


Thr 


He 


Gin 


Asp 


Val 


Glu 


Asn 


Asp Arg 


Lys 


Asp Ser 


Arg 


Asn 


He 










530 










535 








540 


Lys 


Gin Glu Asp 


Thr Cys 


Met 


Lys 


Val 


Thr 


Gin Phe 









545 550 

<210> 33 
<211> 2365 
<212> DMA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 7484831CB1 

<400> 33 

atggagcctg gtgacgcggc gcgccctggc tcgggtcggg ctaccggggc gccgccgccg 60 
cggctgctgc tgctgccgct gctgctgggt tgggggctgc gagtcgcggc cgcggcctcg 120 
gcctcctcct ctggggcggc ggccgaggac agcagcgcca tggaggagct cgctactgag 180 
aaggaggcgg aggagagcca ccggcaagac agcgtgagcc tgctcacctt catcctgctg 240 
ctcacgctca ccatcctcac catctggctc ttcaagcacc gccgggtgcg ctttctgcac 300 
gagaccgggc tggccatgat ctatgggctc atcgttgggg tgatcctgag gtatggtacc 360 
cctgctacca gtggccgtga caaatcactc agctgcactc aggaagacag ggccttcagt 420 
accttattag tgaatgtcag cggaaagttc ttcgaataca ctctgaaagg agaaatcagt 480 
cctggcaaga tcaacagcgt agagcagaat gatatgctac ggaaggtaac attcgatcca 540 
gaagtatttt tcaacattct tctgcctcca attatttttc atgctggata cagcttaaag 600 
aagagacact ttttcagaaa tcttggatct atactggcct atgccttctt ggggactgct 660 
gtttcatgct tcattattgg aaatctcatg tatggtgtgg tgaagctcat gaagattatg 720 
ggacagctct cagataaatt ttactacaca gattgtctct tttttggagc aatcatctct 780 
gccactgacc cagtgactgt gctggcgata tttaatgaat tgcatgcaga cgtggatctt 840 
tacgcacttc tttttggaga gagcgtccta aatgatgctg ttgccattgt actgtcctcg 900 
tctattgttg cctaccagcc agcgggactg aacactcacg cctttgatgc tgctgccttt 960 
tttaagtcag ttggcatttt tctaggtata tttagtggct cttttaccat gggagctgtg 1020 
actggtgttg tgactgctct agtgactaag tttaccaaac tgcactgctt ccccctgctg 1080 
gagacggcgc tgttcttcct catgtcctgg agcacgtttc tcttggcaga agcctgcgga 1140 
tttacaggtg ttgtagctgt ccttttctgt ggaatcacac aagctcatta cacctacaac 1200 
aatctgtcgg tggaatcaag aagtcgaacc aagcagctct ttgaggtgtt acatttcctg 1260 
gcagagaact tcatcttctc ctacatgggc ctggcactgt ttaccttcca gaagcacgtt 1320 
ttcagcccca ttttcatcat cggagctttt gttgccatct tcctgggcag agccgcgcac 1380 
atctacccgc tctccttctt cctcaacttg ggcagaaggc ataagattgg ctggaatttt 1440 
caacacatga tgatgttttc aggcctcagg ggagcaatgg catttgcgtt ggccatccgt 1500 
gacacggcat cctatgctcg ccagatgatg ttcacgacca cccttctcat tgtgttcttc 1560 
actgtctgga tcattggagg aggcacgaca cccatgttgt catggcttaa catcagagtt 1620 
ggcgtcgagg agccctccga agaggaccag aatgaacacc actggcagta cttcagagtt 1680 
ggtgttgacc ccgatcaaga cccaccaccc aacaacgaca gctttcaagt cttacaaggg 1740 
gacggcccag attctgccag aggaaaccgg acaaaacagg agagcacatg gatattcagg 1800 
ctgtggtaca gctttgatca caattatctg aagcccatcc tcacacacag tggtccccca 1860 
ctaaccacca cgctccccgc ctggtgtggc ttactagctc gatgtctgac cagtccccag 1920 
gtgtacgata accaagagcc actgagagag gaagactctg atttcatcct gaccgaaggc 1980 
gacctgacat tgacctacgg ggacagcaca gtgactgcaa atggctcctc aagttcgcac 2040 
accgcctcca cgagtctgga gggcagccgg agaacgaaga gcagctcgga ggaagtgctg 2100 
gagcgagacc tgggaatggg agaccagaag gtttcgagcc ggggcacccg cctagtgttt 2160 
cccctggaag ataatgcttg actttccccc caagccctgg cgcgatggga ctgaagattg 2220 
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aaaatacatc cacgaacatt tcaacatgga acgaagaatt atagttcctt ctctggctat 
atccataaag aagtagttat gaaatgttta aaaccaaagg caaatagtgt tatactctta 
tttttgatta atctgaggaa aggag 

<210> 34 
<211> 3400 
<212> DNA 

<213> Homo sapiens 



2280 
2340 
2365 



<220> 

<221> misc_f eature 

<223> Incyte ID No: 2477266CB1 



<400> 34 

agcggcggct 

ctgctaccgg 

gcccccgccg 

cgccagtttg 

gcggagcaac 

cactgtgtga 

gaggcctggc 

gagagagaga 

caaccccacc 

cctgggctgc 

gaagatgtgc 

atccccatag 

tcagaggtgc 

gctagaaatg 

ttggtcaaag 

acccttttcc 

gattcagaag 

gtacaagttg 

atgcccatgt 

ctggtgctgg 

ccactgctgc 

gcgctgtttg 

agggaggtaa 

cacctagcca 

cagccctacc 

ccccgggcca 

tcgctgctgc 

cgtgaggacc 

gcagtggggg 

tgcgaccagg 

tcgggtctga 

ctgtcactgc 

ctgtttgacg 

gaagaggatg 

gaagacacct 

tacatggaaa 

cggaaggcag 

agctgcccct 

tcctacatgc 

gccctgacag 

gctgagctct 

gacgaggagg 

cacgctggag 

ttctttgccg 

gagaagtcct 

gcccagtttg 

gaggtgcgaa 

gcccaggaac 

catgatcgtg 



cagttgctgc 
acaccgagcg 
ctttgccggc 
cggccgtgct 
gggagagcct 
gcctcagcct 
cacagctttt 
tggggctttt 
accgggagct 
tcttctactc 
ctctcgctcg 
atgaggcaaa 
cggtcatcac 
tggccctggg 
tcaagagcaa 
ccattgtggc 
aggaagagtt 
tggacatgct 
tggaagaggc 
ccgtgctgtc 
agattgtgtg 
ccctgggcca 
tgccactgct 
aggcctgcta 
ttccggagct 
aggagctggc 
cctacttccc 
ttcagcctgt 
agcccatgag 
tagacgacgc 
tgggtgaggg 
gttccaccga 
atgagagtga 
actcagagat 
gtgctgccgt 
gtgtctttga 
cccatgaggc 
cggaacccaa 
aggcagtgaa 
gggtgctccg 
gtggcgtgct 
aggaagagga 
aggccatccc 
gtttcctgcc 
ttgcagtggg 
tgtctcggct 
gcaatgccat 
acttccccaa 
tccgtgacaa 



catggagtca 
catccgtcgg 
tctctgcgac 
gacccgcaga 
caagtccctg 
ggcccagctc 
gcagctgctt 
gctgctaagt 
tcttcggctt 
cctgcgcact 
gatgttggtg 
ggcctgtgag 
cccctacctc 
caatgcgata 
ggccttactg 
tgctgagccc 
ggagattgag 
ggcactacac 
tttgcggagc 
tgacggagct 
caagggcctg 
gttctcagaa 
cctcgcctac 
tgccctggag 
tatggaatgc 
tgtgagcgcc 
tgccatcatg 
gcagatccag 
gccgctggct 
tgacttgcgg 
cctggcgccc 
gggcattgtg 
tggggaagaa 
ctcagggtac 

gggggagatc 

agaagtattt 
tctgggtcag 
cactgctgct 
cagggagcgg 
cagctgtggg 
caaggctgtg 
agatgatgat 
tgccctggca 
attattggtg 
gaccttggca 
gctccctgtg 
cttcgggatg 
gctgctgggg 
catctgtggg 



gccgggctag 
gccacggaac 
ctgctagcct 
cgactgaaca 
atcctgacgg 
tcagccacca 
cagcacagta 
gtggtggtga 
ctgaatgaga 
ctgaccacca 
cccaagctga 
gcccttgagg 
tctgaagtcc 
cgcatacgta 
aagaatcgtc 
ccaccaggcc 
ctgatggggg 
ctgccccccg 
gagagcccat 
ggcgaccaca 
gaggacccct 
aacctacagc 
ttgaagtcgg 
aattttgtgg 
atgctgcagc 
ctgggagcca 
gagcacctgc 
agcctggaga 
gaggaatgct 
cgctgcacgt 
cacttggaac 
cctcagtatg 
gaggaggagc 
agcgtggaga 
tctgtgaaca 
aaactgctgg 
ttttgctgtg 
ttgcaggctg 
gaacgccagg 
accctcacac 
ctgcagagga 
caggctgaat 
gccgcggctg 
tgcaagacaa 
gagactattc 
ctgttgagca 
ggcgtgctgg 
ctcctttttc 
gcacttgccc 



agcagctcct 
agctccagat 
cggcggccga 
cccgctggcg 
ccctgcagag 
tttttcgaaa 
cccacagccc 
cctcccggcc 
ctcttggtga 
tggctcccta 
tcatggccat 
ctttggatga 
tcacattctg 
ttctctgctg 
tcctgccacc 
agttggatcc 
agactcccaa 
agaagctctg 
accagcgcaa 
tcaggcagag 
cgcaagttgt 
cccatatcag 
tgcctcttgg 
agaacctagg 
ttctgaggaa 
ttgctacggc 
gggaattcct 
cactgggggt 
gccagctggg 
acagcctatt 
agatcaccac 
acgggagcag 
tcatggatga 
atgccttctt 
ccagtgtggc 
agtgccctca 
cactgcacaa 
ccctggcccg 
tggtgatggc 
tgaagccccc 
agacagcctg 
acgacgccat 
ggggagactc 
aacagggctg 
agggcctggg 
ccgcccaaga 
cagagcatgg 
ccctcctggc 
gcctgttgat 



acgggagctg 
cgttcttcgg 
cccccagatc 
acggctggcg 
agaaacagag 
ggaaggcttg 
ccacagccca 
cgaggccttc 
ggtgggctct 
cctcagcact 
gcagactctg 
actgttggag 
cctggaggta 
cctcactttc 
cttgctgcac 
cgaggaccag 
gcatttcgct 
tccccagctg 
agctggactc 
actgctgccc 
acgcaatgct 
cagctattca 
acacacacac 
gcccaaggtg 
ccccagcagt 
tgcccaggcc 
gttaacaggc 
gctggcacga 
tctgggcctc 
tgcagcctta 
gctcatgctg 
ctccttcctt 
ggatgtggaa 
cgatgagaag 
cttccttcca 
cctgaatgtg 
ggcctgtcaa 
agtcgtgcca 
cgtgctggag 
tgggcgcctc 
tcaggatact 
gttgctggag 
ctttgcccca 
cacagtggca 
tgctgcctca 
ggcagacccc 
gggccaccct 
gcgggagcga 
ggccagtccc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 
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-* accaggaaac cagagcccca ggtgctggct gccctactgc atgccctgcc actgaaggag 3000 

gacttggagg agtgggtcac cattgggcgc ctcttcagct tcctgtacca gagcagccct 3060 

gaccaggtta tagatgtggc tcccgagctt ctgcgtatct gcagcctcat tctggctgac 3120 

aacaagatcc caccagacac caaggccgca ctgttgctgc tcctgacgtt cctggccaaa 3180 

cagcacaccg acagctttca agcagctctg ggctcactgc ctgttgacaa ggctcaggag 3240 

ctccaggctg tactgggcct ctcctagact gcaggctgca gccagtccag agagaataga 3300 

gcctgcccag gccttaagac cacctctcag cccagttcag ttctgcctta ccaaagattc 3360 
tgagactcat acccatttgg agccagcccc acttgctgcc 3400 

<210> 35 

<211> 4458 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 3552033CB1 

<400> 35 

gttgttgcta gtatgatata gatcatgtgt^ acgtctatgt tagtgctatg tacgttgtta 60 

tagtgtcttg cttgttgtat ctatattgtt' attgtttcgt tttgtgtgtg tat a tat tat 120 

tatcgtttgc cccgtttaat taccctccat ggggccccga gttttatttt ccccccttgg 180 

tgggatgcgt cttcaatcac cgcttttata ccctgatatt cggttcaagg gactttctcc 240 

gggggctctt tcggcaccac cgcaagaacc ggtctcagct caaatcccgg aatgaaggcc 300 

tttaaaaaga agatgtgccc aacagtgaaa ccccgccccc aggccccata ggtttccttg 360 

ggcttctgat aggcccagca ttggccctca cccccagtat ctccggcttc agaattgaga 420 

aaaatgaaag tcgcctctcc cgaaaggaca tccagtctga aaagtggtcc atcagcaaac 480 

acactcaact cagccctacg gatgcttttg ggaccattga gttccaagga ggtggccatt 540 

ccaacaaagc catgtatg.tg cgagtatctt ttgatacaaa acctgatctc ctcttacacc 600 

tgatgaccaa ggaatggcag ttggagcttc ccaagcttct catctctgtc catgggggcc 660 

tgcagaactt tgaactccag ccaaaactca agcaagtctt tgggaaaggg ctcatcaaag 720 

cagctatgac aactggagcg tggatattca ctggaggggt taacacaggt gttattcgtc 780 

atgttggcga tgccttgaag gatcatgcct ctaagtctcg aggaaagata tgcaccatag 840 

gtattgcccc ctggggaatt gtggaaaacc aggaggacct cattggaaga gatgttgtcc 900 

ggccatacca gaccatgtcc aatcccatga gcaagctcac tgttctcaac agcatgcatt 960 

cccacttcat tctggctgac aacgggacca ctggaaaata tggagcagag gtgaaacttc 1020 

gaagacaact ggaaaagcat atttcactcc agaagataaa cacaagaatc ggtcaaggtg 1080 

ttcctgtggt ggcactcata gtggaaggag gacccaatgt gatctcgatt gttttggagt 1140 

accttcgaga cacccctccc gtgccagtgg ttgtctgtga tgggagtgga cgggcatcgg 1200 

acatcctggc ctttgggcat aaatactcag aagaaggcgg actgataaat gaatctttga 1260 

gggaccagct gttggtgact atacagaaga ctttcacata cactcgaacc caagctcagc 1320 

atctgttcat catcctcatg gagtgcatga agaagaagga attgattacg gtatttcgga 1380 

tgggatcaga aggacaccag gacattgatt tggctatcct gacagcttta ctcaaaggag 1440 

ccaatgcctc ggccccagac caactgagct tagctttagc ctggaacaga gtcgacatcg 1500 

ctcgcagcca gatctttatt tacgggcaac agtggccggt gggatctctg gagcaagcca 1560 

tgttggatgc cttagttctg gacagagtgg attttgtgaa attactcata gagaatggag 1620 

taagcatgca ccgttttctc accatctcca gactagagga attgtacaat acgagacatg 1680 

ggccctcaaa tacattgtac cacttggtca gggatgtcaa aaaggggaac ctgcccccag 1740 

actacagaat cagcctgatt gacatcggcc tggtgatcga gtacctgatg ggcggggctt 1800 

atcgctgcaa ctacacgcgc aagcgcttcc ggaccctcta ccacaacctc ttcggcccca 1860 

agaggcccaa agccttgaaa ctgctgggaa tggaggatga tattcccttg aggcgaggaa 1920 

gaaagacaac caagaaacgt gaagaagagg tggacattga cttggatgat cctgagatca 1980 

accacttccc cttccctttc catgagctca tggtgtgggc tgttctcatg aagcggcaga 2040 

agatggccct gttcttctgg cagcacggtg aggaggccat ggccaaggcc ctggtggcct 2100 

gcaagctctg caaagccatg gctcatgagg cctctgagaa cgacatggtt gacgacattt 2160 

cccaggagct gaatcacaat tccagagact ttggccagct ggctgtggag ctcctggacc 2220 

agtcctacaa gcaggacgaa cagctggcca tgaaactgct gacgtatgag ctgaagaact 2280 

ggagcaacgc cacgtgcctg cagcttgccg tggctgccaa acaccgcgac ttcatcgcgc 2340 

acacgtgcag ccagatgctg ctcaccgaca tgtggatggg ccggctccgc atgcgcaaga 2400 

actcaggcct caaggtaatt ctgggaattc tacttcctcc ttcaattctc agcttggagt 2460 

tcaagaacaa agacgacatg ccctatatgt ctcaggccca ggaaatccac ctccaagaga 2520 

aggaggcaga agaaccagag aagcccacaa aggaaaaaga ggaagaggac atggagctca 2580 

tagcaatgtt gggacgaaac aacggggagt cctccaggaa gaaggatgaa gaggaagttc 2640 
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agagcgagca ccggttaatc cccctcggca 
tcgtgaagtt ctggttctac acactggcgt 
tcgtgttagt gaagatggaa cgctggccgc 
ttttcaccct gggaatagaa aagatgagag 
tacagaaagt gaaggtatgg ctgcaggagc 
ttctgttttc tgtcggaatg atccttcgtc 
gggtcatcta ctgcgtgaac atcatttact 
tgaacaagta tttgggcccg tatgtaatga 
actttgtcat cattatgctg gtggttctga 
tttttcccaa tgaggagcca tcatggaaac 
ggatgattta tggggaagtg tttgcggacc 
cccgagagga tggtaaaata atccagctgc 
cggccatcat ggcctgctac ctcttagtgg 
ctgtctctaa caatacattt tttgaagtaa 
agaggtatca gctcatcatg actttccatg 
tcttcagcca catgaccatg atattccagc 
gcgacccgga tgaaagggac tacggcctga 
aagtacatga ctttgaagag caatgcatag 
tcaactcatc taatgatgag aggatacggg 
tgcggctgga ggaagtcaac gagagagagc 
acatccggct ggcgcagctg gaagacctta 
tgacaggtct ggagcgggcc gagtccaaca 
cggacgccgc ctacattgtc cgtcagagca 
agctccaaga gagtatagac cctgcagagc 
agcagttcta cctaacgtcc cacatccttc 
taaagaaaaa acaaggtaac agtcatagtt 
ccacaactgt ttatcctttt ttgaagaaaa 
aatcgaaatg aaagaagagt tgatgttaaa 
agtgggcgtt caagtctagt cgagcaagct 
ttttaattta ttttgtatct cacctgtcat 
tgaaccttaa aaaaaaaa 

<210> 36 
<211> 2722 
<212> DNA 

<213> Homo sapiens 



gaaaaatcta tgaattctac aatgcaccca 2700 
atatcggata " cctgatgctc ttcaactata 2760 
ccacccagga atggatcgta atctcctata 2820 
agattctgat gtcagagcca gggaagttgc 2880 
actggaatgt cacggacctc atcgccatcc 2940 
tccaagacca gcccttcagg agtgacggga 3000 
ggtatatccg tctcctagac atcttcggcg 3060 
tgattggaaa aatgatgata gacatgatgt 3120 
tgagctttgg ggtcgccagg caagccatcc 3180 
tggccaagaa catcttctac atgccctatt 3240 
agatagaccc tccctgtgga cagaatgaga 3300 
ctccctgcaa gacaggagct tggatcgtgc 3360 
caaacatctt gctggtcaac ctcctcattg 3420 
aatcgatatc caaccaagtc tggaagtttc 3480 
aaaggccagt tctgccccca ccactgatca 3540 
acctgtgctg ccgatggagg aaacacgaga 3600 
aactcttcat aaccgatgat gagctcaaga 3660 
aagaatactt cagagaaaag gatgatcggt 3720 
tgacttcaga aagggtggag aacatgtcta 3780 
actccatgaa ggcttcactc cagaccgtgg 3840 
tcgggcgcat ggccacggcc ctggagcgcc 3 900 
aaatccgctc gaggacctcg tcagactgca 3960 
gcttcaacag ccaggaaggg aacaccttca 4020 
atcctcttta ttcagtataa gccggcagca 4080 
tcatgccaac acttctgtaa ttgatcatta 4140 
cacctgtctc ttatctattc acttctggtg 4200 
taagggaaca gaaatgcctt tttgtattgc 4260 
aaaacaaaag tcaagtgatt tattatatac 4320 
caggagaatg taattaaata attttatatt 4380 
ggatgaattc attcactgaa tatgtaatat 4440 

4458 



<220> 

<221> misc_feature 

<223> Incyte ID No: 4778139CB1 

<220> 

<221> unsure 
<222> 2609 

<223> a, t, c, g, or other 
<400> 36 

atggagatcg ctggagagaa gaaataccag 
agcctggctg gaggccggca tcccccaaga 
aacagtaaaa gagaggctac aaatgtaaga 
agtatagtgg tttctggggt gtctcctaac 
cttctggtcc cctctaatct aaagcgagtt 
tccagaccct ctgtgccaga tgtgctggca 
gttcatctga ctgggcagac cccagttgtc 
gaaaagcacg agcctattgc tcacttactg 
cccactttga gttgcacacc tgaacccatg 
atcatgcaaa ctggaggtgg atatgtaaac 
gtgtttagca tcaagaaaaa gtggcaaacc 
tctcggagtg aacagcacca gaagcaggcc 
agatcggccc agagtagtga atgctcagag 
ttttggatca atcctctccg cagatacagg 
cataaggagg agaaattgat gaagatctgt 
atagggcagg gagcatgggt caaagcgaaa 



tattatagca tgccctcttc tttgctgagg 60 
gtgcagaggg gggcacttct tgagattaca 120 
aatgaccagg aaaggcaaga aacacaaagt 180 
aggcaagcac actcaaagta tggccagttt 240 
cctttttcag cagaaactgt cttgcctctg 300 
actgaacagg acatccagac tgaggttctt 3 60 
tctgactggg catcagtgga tgaacccaag 420 
gatggacaag acaaggcaga gcaagtgcta 480 
acaatgagct cccctctttc ccaagccaag 540 
tgggcatttt cagaaggtga tgaaactggt 600 
tgcttgccct ccacttgtga cagtgattcc 660 
caggacagct ccctatctga taactcaaca 720 
gtgggaccat ggcttcagcc aaacacatcc 780 
cccttcgcta ggagtcatag ttttagattc 840 
aagattaaaa atctttcagg ctcttcagaa 900 
atgctaacca aagacaggag actgtcaaag 960 
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aaaaagaaga atactcaagg actccaggtg ccaatcataa cagtcaatgc ctgctctcag 1020 
agtgaccagt tgaatccaga gccaggagaa aacagcatct ctgaagagga gtacagcaag 1080 
aactggttca cagtgtccaa atttagtcac acaggtgtag aaccttacat acatcagaaa 1140 
atgaaaacta aagaaattgg acaatgtgct atacaaatca gtgattacct aaagcagtct 1200 
caagagtctg ctcaggatct cagcaaaaac tctttgtgga attccaggag caccaacctc 1260 
aataggaact ccctgtcctc tctaatttca gagatctcag cctccttaaa aagccctcaa 1320 
gagcctcacc atcattattc accttcactt ctctttgcag caggagaaga aataactgtc 1380 
tacaggttgg aggagagttc ccctttaaac cttgataaaa gcatgtcctc ttggtctcag 1440 
cgtgggagag cggcaatgat ccaggtattg tcccgagagg agatggatgg gggcctccgt 1500 
aaagctatga gagtcgtcag cacttggtct gaggatgaca ttctcaagcc gggacaagtt 1560 
ttcattgtca agtcctttct tcctgaggtt gtgcggacat ggcataaaat cttccaggag 1620 
agcactgtgc ttcatctttg cctcagggaa attcaacaac aaagagctgc tcaaaaattg 1680 
atctatacct tcaaccaagt gaaaccacaa accataccct acacaccaag gttcctggaa 1740 
gttttcttaa tctactgcca ttcagccaac cagtggttga ccattgagaa gtatatgaca 1800 
ggggagttcc ggaagtataa caacaacaat ggtgatgaaa tcacccccac caacaccctg 1860 
gaggagctga tgttggcttt ctctcactgg acctatgagt acactcgggg agagctgctg 1920 
gttttagatt tgcaaggtgt tggagaaaat ttgacagatc catctgttat aaaacctgaa 1980 
gtcaaacaat caagaggaat ggtgtttgga ccggccaatt tgggggaaga tgcaattaga 2040 
aacttcattg caaaacatca ttggaactcc tgctgccgga agctcaaact cccggattta 2100 
aaaagaaatg actactcccc tgaaaggata aattccacct ttggacttga gataaaaata 2160 
gaatcagctg aggagcctcc agcaagggag acgggtagaa attccccaga agatgatatg 2220 
caactataaa aagggaggag caagaagatc ccagtgcttg ccctgcctgc caggaactct 2280 
gtgataacat agattgatca acgtgatgtt gattacatca gcgtctcctt gggacacgcc 2340 
ttctgagcct cacatctcct tctgttcaaa ggcctcattg gtatatgatc aatgggttct 2400 
cctagacact gacctctgtc cagggcactt tgcagctcca tcctcaagtt ccacacgaag 2460 
atgcttggat gagtcagctg ggaatattgt tcttgtgtac ctcattgctt tagctggtca 2520 
cttggaactt tggagcagaa tcctgcacat taaaggatgg ggttgggggg gatacattta 2580 
ttttattttc tcactatgta tgcagactng accccctact actatttgtc acctcaccca 2640 
cagattgtat ttatgtctat atatatgttt cataaaaagc ttaccaaacc caaaaaaaaa 2700 
aaaaaaaaaa aaaaaaaaaa aa 2722 

<210> 37 
<211> 1924 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 4787433CB1 

<400> 37 

cggcgcccag tgcacggccg caccatgggg tcccgccact tcgaggggat ttatgaccac 60 

gtggggcact tcggcagatt ccagagagtc ctctatttca tatgtgcctt ccagaacatc 120 

tcttgtggta ttcactactt ggcttctgtg ttcatgggag tcacccctca tcatgtctgc 180 

aggcccccag gcaatgtgag tcaggttgtt ttccataatc actctaattg gagtttggag 240 

gacaccgggg ccctgttgtc ttcaggccag aaagattatg ttacggtgca gttgcagaat 300 

ggtgagatct gggagctctc aaggtgtagc aggaataaga gggagaacac atcgagtttg 360 

ggctatgaat acactggcag taagaaagag tttccttgtg tggatggcta catatatgac 420 

cagaacacat ggaaaagcac tgcggtgacc cagtggaacc tggtctgtga ccgaaaatgg 480 

cttgcaatgc tgatccagcc cctatttatg tttggagtcc tactgggatc ggtgactttt 540 

ggctactttt ctgacaggct aggacgccgg gtggtcttgt gggccacaag cagtagcatg 600 

tttttgtttg gaatagcagc ggcgtttgca gttgattatt acaccttcat ggctgctcgc 660 

ttttttcttg ccatggttgc aagtggctat cttgtggtgg ggtttgtcta tgtgatggaa 720 

ttcattggca tgaagtctcg gacatgggcg tctgtccatt tgcattcctt ttttgcagtt 780 

ggaaccctgc tggtggcttt gacaggatac ttggtcagga cctggtggct ttaccagatg 840 

atcctctcca cagtgactgt cccctttatc ctgtgctgtt gggtgctccc agagacacct 900 

ttttggcttc tctcagaggg acgatatgaa gaagcacaaa aaatagttga catcatggcc 960 

aagtggaaca gggcaagctc ctgtaaactg tcagaacttt tatcactgga cctacaaggt 1020 

cctgttagta atagccccac tgaagttcag aagcacaacc tatcatatct gttttataac 1080 

tggagcatta cgaaaaggac acttaccgtt tggctaatct ggttcactgg aagtttggga 1140 

ttctactcgt tttccttgaa ttctgttaac ttaggaggca atgaatactt aaacctcttc 1200 

ctcctgggtg tagtggaaat tcccgcctac accttcgtgt gcatcgccat ggacaaggtc 1260 

gggaggagaa cagtcctggc ctactctctt ttctgcagtg cactggcctg tggtgtcgtt 1320 
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atggtgatcc cccagaaaca ttatattttg 
gccatcgggg cagcatttgg cctcatttat 

gtaagatcgc tggctgtggg aagcggcagc 

ccgttctctg tggacctcag cagcatttgg 

atggccctcc tgagtggagt gttaacacta 

gcaactactt gggaggaggc tgcaaaactg 

ttacttctca caactaataa tagtgggctg 

tctggtcttg gtgaataaat gtgccatgcc 

ctaatgcctt tgtattagag gaatcttatt 

tttaataaat tttgtaagaa aattttaaag 
gatg 



ggtgtggtga cagctatggt tggaaaattt 13 80 

ctttatacag ctgagctgta tccaaccatt 1440 

atggtgtgtc gcctggccag catcctggcg 1500 

atcttcatac cacagttgtt tgttgggact 1560 

aagcttccag aaacccttgg gaaacggcta 1620 

gagtcagaga atgaaagcaa gtcaagcaaa 1680 

gaaaaaacgg aagcgattac ccccagggat 1740 

tgctgtctag cacctgaaat attatttacc 1800 

ctcatctccc atatgttgtt tgtatgtctt 1860 

caaatatgtt ataaaagaaa taaaaactaa 1920 

1924 



<210> 38 

<211> 1797 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 7483598CB1 



<400> 38 

agcatccccg 

aggctgccac 

ggggcgcccg 

gcgaggtggc 

ctgctgagga 

agagagattt 

gtcagtctgc 

tgccttattc 

catatgttac 

atacctacca 

ttcttcagtt 

ctttgagcaa 

gccacttcat 

atatagcaaa 

gaattgtaat 

gggtatttgc 

gccaccataa 

cccgccttat 

gtggatactt 

atgatgacct 

ctatggaatg 

catcggtttt 

tgctgattga 

tcatttttat 

ccgataagat 

tcgtcatggc 

ttcctgacaa 

tttctacttt 

tcatagactt 

aattattatt 



tcccggagga 
agcctagaga 
tggaactggc 
ggaggaggct 
ccaccgctca 
agatgacaga 
tgctcttttt 
aatgaagcaa 
agacttttcc 
gtctttggtc 
tttgtatcct 
agtttttcaa 
tattggactt 
gcttggaaag 
ggcaagggca 
aaagcccaat 
ctccttctta 
ccatatgtcc 
gacatttact 
ggtaacattt 
ctttgtgaca 
ccacattgtt 
ttgcctcggg 
cattccatca 
tatgtcttgt 
tattacaaat 
tttctctctc 
aaatattagt 
taaaacacat 
ttggctttta 



aaaaacattt 
tcttggggct 
tccctgcagc 
ggccgggtgc 
gccatgggct 
gaaacccttg 
aatgttgtca 
gctgggtttc 
cttgttttat 
aataaaactt 
tttatagcaa 
agaatcccag 
tccacagtta 
gtctccctca 
atttcactgg 
gccattcaag 
gtttacagtt 
atcgtgattt 
ggcttcaccc 
ggaagatttt 
agagaggtaa 
gtaacagtga 
atagttctag 
gcctgttatc 
gtcatgcttc 
actcaagact 
acaaatacct 
atctttcaat 
aacatttacg 
tcaagacttg 



gtctggcgaa 
tcagcccctc 
tctgcggcta 
gaatccgtac 
accagaggca 
tttctgaaca 
actcgattat 
ctttgggaat 
tgataaaagg 
tcggctttcc 
tgataagtta 
gagttgatcc 
cctttactct 
tctctacagg 
gtccacacat 
cggtcggggt 
ctctagaaga 
ctgtatttat 
aaggggactt 
gttatggtgt 
ttgccaatgt 
tggtcatcac 
aactcaatgg 
tgaaactgtc 
ccattggtgc 
gcacccatgg 
cagagtctca 
gagttgactg 
cttgctttag 
gcttttatga 



ctccgggtgg 
gcggcctgcc 
cacgcggacc 
ccagccccag 
ggagcctgtc 
tgagtataaa 
aggatctggt 
attgctttta 
aggggccctc 
agggtatctg 
caatataata 
tgaaaacgtg 
gcctttatcc 
tttaacaact 
accaaaaaca 
tatgtctttt 
acccacagta 
ctgtatattc 
atttgaaaat 
cactgtcatt 
gttttttggt 
tgtagccacg 
tgtgctctgt 
tgaagaacca 
tgtggtgatg 
gcaggaaatg 
tgttcagcag 
ctttaaaaat 
tctgtattta 
gtagtgcaat 



aaagcgcccc 
gagggagcag 
tcggctgtgt 
catcttccac 
atcccgccgc 
gagaaaacct 
ataataggat 
ttctgggttt 
tctggaacag 
ctcctctctg 
gctggagata 
tttattggtc 
ttgtaccgaa 
ctgattcttg 
gaagacgctt 
gcatttattt 
gctaagtggt 
tttgctacat 
tactgcagaa 
ttgacatacc 
gggaatcttt 
cttgtgtcat 
gcaactcccc 
aggacacact 
gtttttggat 
ttctactgct 
acaacacaac 
atgtatgttt 
tgttatataa 
ataaaaa 



<210> 39 

<211> 3277 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_ feature 
<223> Incyte ID No: 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1797 



7484823CB1 



<400> 39 
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atgggggaag gagatagtgg gttgaggagc ccaagaggcc cacaggaagg tgggaaatgc 60 
agtagggaaa aacagaagcg caatatggag gaactgaaga aggaagtggt catggatgat 120 
cacaaattaa ccttggaaga gctgagcacc aagtactccg tggacctgac aaagggccat 180 
agccaccaaa gggcaaagga aatcctgact cgaggtggac ccaatactgt taccccaccc 240 
cccaccactc cagaatgggt caaattctgt aagcaactgt tcggaggctt ctccctccta 300 
ctatggactg gggccattct ctgctttgtg gcctacagca tccagatata tttcaatgag 360 
gagcctacca aagacaacct ctacctgagc atcgtactgt ccgtcgtggt catcgtcact 420 
ggctgcttct cctattatca ggaggccaag agctccaaga tcatggagtc ttttaagaac 480 
atggtgcctc agcaagctct ggtaattcga ggaggagaga agatgcaaat taatgtacaa 540 
gaggtggtgt tgggagacct ggtggaaatc aagggtggag accgagtccc tgctgacctc 600 
cggcttatct ctgcacaagg atgtaaggtg gacaactcat ccttgactgg ggagtcagaa 660 
ccccagagcc gctcccctga cttcacccat gagaaccctc tggagacccg aaacatctgc 720 
ttcttttcca ccaactgtgt ggaaggaacc gcccggggta ttgtgattgc tacgggagac 780 
tccacagtga tgggcagaat tgcctccctg acgtcaggcc tggcggttgg ccagacacct 840 
atcgctgctg agatcgaaca cttcatccat ctgatcactg tggtggccgt cttccttggt 900 
gtcacttttt ttgcgctctc acttctcttg ggctatggtt ggctggaggc tatcattttt 960 
ctcattggca tcattgtggc caatgtgcct gaggggctgt tggccacagt cactgtgtgc 1020 
ctgaccctca cagccaagcg catggcgcgg aagaactgcc tggtgaagaa cctggaggcg 1080 
gtggagacgc tgggctccac gtccaccatc tgctcagaca agacgggcac cctcacccag 1140 
aaccgcatga ccgtcgccca catgtggttt gatatgaccg tgtatgaggc cgacaccact 1200 
gaagaacaga ctggaaaaac atttaccaag agctctgata cctggtttat gctggcccga 1260 
atcgctggcc tctgcaaccg ggctgacttt aaggctaatc aggagatcct gcccattgct 1320 
aagagggcca caacaggtga tgcttccgag tcagccctcc tcaagttcat cgagcagtct 13 80 
tacagctctg tggcggagat gagagagaaa aaccccaagg tggcagaggt tccctttaat 1440 
tctaccaaca agtaccagat gtccatccac cttcgggagg acagctccca gacccacgta 1500 
ctgatgatga agggtgctcc ggagaggatc ttggagtttt gttctacctt tcttctgaat 1560 
gggcaggagt actcaatgaa cgatgaaatg aaggaagcct tccaaaatgc ctatttagaa 1620 
ctgggaggtc tgggggaacg tgtgctaggc ttctgcttct tgaatctgcc tagcagcttc 1680 
tccaagggat tcccatttaa tacagatgaa ataaatttcc ccatggacaa cctttgtttt 1740 
gtgggcctca tatccatgat tgaccctccc cgagctgcag tgcctgatgc tgtgagcaag 1800 
tgtcgcagtg caggaattaa ggtgatcatg gtaacaggag atcatcccat tacagctaag 1860 
gccattgcca agggtgtggg catcatctca gaaggcactg agacggcaga ggaagtcgct 1920 
gcccggctta agatccctat cagcaaggtc gatgccagtg ctgccaaagc cattgtggtg 1980 
catggtgcag aactgaagga catacagtcc aagcagcttg atcagatcct ccagaaccac 2040 
cctgagatcg tgtttgctcg gacctcccct cagcagaagc tcatcattgt cgagggatgt 2100 
cagaggctgg gagccgttgt ggccgtgaca ggtgacgggg tgaacgactc ccctgcgctg 2160 
aagaaggctg acattggcat tgccatgggc atctctggct ctgacgtctc taagcaggca 2220 
gccgacatga tcctgctgga tgacaacttt gcctccatcg tcacgggggt ggaggagggc 2280 
cgcctgatct ttgacaacct gaagaaatcc atcatgtaca ccctgaccag caacatcccc 2340 
gagatcacgc ccttcctgat gttcatcatc ctcggtatac ccctgcctct gggaaccata 2400 
accatcctct gcattgatct cggcactgac atggtccctg ccatctcctt ggcttatgag 2460 
tcagctgaaa gcgacatcat gaagaggctt ccaaggaacc caaagacgga taatctggtg 2520 
aaccaccgtc tcattggcat ggcctatgga cagattggga tgatccaggc tctggctgga 2580 
ttctttacct actttgtaat cctggctgag aatggtttta ggcctgttga tctgctgggc 2640 
atccgcctcc actgggaaga taaatacttg aatgacctgg aggacagcta cggacagcag 2700 
tggacctatg agcaacgaaa agttgtggag ttcacatgcc aaacggcctt ttttgtcacc 2760 
atcgtggttg tgcagtgggc ggatctcatc atctccaaga ctcgccgcaa ctcacttttc 2820 
cagcagggca tgagaaacaa agtcttaata tttgggatcc tggaggagac actcttggct 2880 
gcatttctgt cctacactcc aggcatggac gtggccctgc gaatgtaccc actcaagata 2940 
acctggtggc tctgtgccat tccctacagt attctcatct tcgtctatga tgaaatcaga 3000 
aaactcctca tccgtcagca cccggatggc tgggtggaaa gggagacgta ctactaaact 3060 
cagcagatga agagcttcat gtgacacagg ggtgttgtga gagctgggat ggggccagag 3120 
attataagtt tgacacaaca tctgagacac taggatgaat tatcttggat gagaaagatg 3180 
ggcaatcctg ggctggcttg agggaatcat gggcagagga tgaggtgggc tgaagggaag 3240 
cccagcctgc atctagctgg agccccgcag ggagggg 3277 

<210> 40 

<211> 2773 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 
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<223> Incyte ID No: 143935CB1 



<400> 40 

atggaagaga 

accacttcct 

gccctctact 

ttcagcgggc 

gggaagcact 

agccgccggg 

gtggaagagt 

cctgtgatca 

gtgaagttgc 

aatatgttga 

ctgttcataa 

cggaagattg 

aaggtggagt 

gcccaggatg 

cgctcccacc 

cacatcaagc 

cgccctcctc 

cgggcccctt 

tttggtattg 

tgaacaagct 

gctttctcgg 

tttgcaatat 

cttaagggtt 

tctgagctcc 

ctgtgtgttt 

gctttgggcc 

agcgtctaaa 

ccttttgcca 

taggagcaga 

tccagccccg 

ccctgtgaca 

ccttccccat 

catacccttt 

agggagcttc 

gtagagtggc 

ggtttgtgtc 

gcttgtgggg 

gacgggtgtt 

gagtcgtgac 

gaatgtgcgt 

gtattaacat 

gtttttacag 

ttgtatccaa 

tcgtgtattg 

tggattttat 

tgtttaagtg 

aagtgaatgt 



tgtcaggaga 

tcaagggcac 

ataccaccat 

gcatggaagt 

ttggtacgat 

agatcgagga 

gccaggcggc 

cctcatccaa 

tctacaacag 

aaaacattga 

aggatgttat 

ctgaagtctg 

ttcccgaagc 

gccggggacc 

acctggacga 

gccctgatga 

tcaccgcccc 

ctgcttccct 

cttgacaagg 

gtgtctaaga 

ttctcagagg 

ttatttttgt 

cagtttgaca 

ctgcttccta 

ggcccgggca 

tagagcttgt 

tcgttgagac 

ccttgggctg 

actttccatg 

tccagttgtt 

ggcatagctc 

gaagctgggc 

gtcctcccag 

cctgctataa 

cagcagtgaa 

tgtgttcatg 

cacaggtggc 

ctcagtgacc 

tgacaggggt 

caggtgatgg 

gggcagcctt 

cttagacttt 

cagcaaagtc 

ggcagatgct 

taagtgctaa 

ttaaacacca 

tgg 



aagtgtggtg 

gagccccagc 

gcagacgctg 

gctcaccgac 

actcaactac 

gctgctagca 

cctacaaaac 

ggaagaacaa 

aagtaacaac 

actgtttgat 

tggggatgaa 

ttgtacctcc 

ccggatttat 

tgacaatgcg 

ggacgaggag 

ccgggcccac 

cactccctgc 

tggagcctgg 

caaattgatt 

tctctacttt 

aaaagtatga 

gggtgtcgac 

attctgagag 

tttcataatc 

caatccccac 

tgataattgc 

cagtgcaact 

gtctttggcc 

gcagtaagtg 

ctcctgattg 

ataaactacc 

taactttcta 

cctggcatcc 

acacagaccc 

caggttgagg 

tttgtttaag 

ggcccctgct 

tacctcccag 

ggttgagact 

atgggtttgt 

tgacacatgt 

gtacttactg 

tacaataaaa 

ttttaaactg 

ggaaagaggg 

accctgtctt 



agctcagcgg 

tccaaatacg 

accaagcagg 

agtgaaggct 

cttcgagacg 

gaagccaagt 

aaagatactt 

aaacttatag 

aaatactcat 

aagctgtctc 

atctgctgct 

atcgtctatg 

gaggagaccc 

ctcctggagg 

cgggagcgga 

ctccaccagt 

cgtgctacac 

agatactttt 

gtcttgaccc 

tcatgagaat 

atgagtgtga 

ttcctatgtg 

ttgtcctgca 

ctccagcccc 

tgctttgcta 

agcttgtggc 

ttgggtgcaa 

tggtgctcac 

tccagctctg 

acccgactcc 

cctccctggg 

agtcattttg 

aggcagggac 

ccttgtcttt 

atgtgcgggt 

ggatatgtgt 

ggagcccggc 

gctcctctgc 

agactaggta 

caagggaatc 

attccaaaaa 

ccctgcctgt 

ctttaaaaca 

tcgtgtgaga 

cttacaaaat 

tcttttgggt 



tgccagcggc 

tgaagctgaa 

acaccatgct 

ggatcctcat 

gggcggtgcc 

actacctagt 

atgagccttt 

cgacttcaaa 

ataccagcaa 

tgcgctttaa 

ggtcctttta 

ccactgagaa 

tgaacatttt 

ccacaggcgg 

tcgagcgcgt 

gagcaggcaa 

ccagatcctg 

gtaacaagcc 

aggcgtatga 

ctgagactct 

agtgtatgtg 

ggctttttgg 

gttggaggcc 

agcaggtcca 

gacgtgcttt 

agtggaaata 

ggctttgttt 

tgggacccca 

tttctggttc 

actccaggaa 

atcccgctcc 

cttagaaatt 

accctcacac 

gcctctgatt 

agatagataa 

gactgtgggt 

tgggcgcagc 

acctgcaaag 

gagtagttac 

gttaccgttt 

cgagtttata 

gacagttgta 

atcatgactg 

aacttttata 

gtttcgtaaa 

tgagcttttt 



tgctacccgc 

tgtgggtgga 

gaaggccatg 

tgaccgctgt 

tttacccgag 

ccaaggcctg 

ctgcaaggtc 

taagccagcc 

ttctgacgac 

cggaagggtc 

tggtcagggc 

gaaacagacc 

gctgtatgag 

ergcggcgggg 

gcggaggatc 

gagaccgagc 

tgcaggctgc 

agatgattat 

cccctgtcgt 

ttggagccag 

agaacttttg 

gtgacactcc 

accagaggta 

ctcctggttc 

ctgccatgtg 

tggctgaatg 

agggatcaag 

tatgtctgcg 

tttccccaac 

ggccatctga 

tcttcagcct 

cagtgtggcc 

caccagcccc 

tttacacagt 

ctttgggtct 

ggggacgtgt 

gcctatgtag 

gaacaggagt 

caggagatgt- 

tataccaaag 

ttttcaaacg 

tgccttcatt 

aatgtcaaaa 

ttaggccatt 

tattttatac 

tagaaagtcg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2773 



<210> 41 

<211> 2966 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 5923789CB1 

<400> 41 

tttttccaag tgagagactg ccccatatgt tagtgcaata tgtcactgga ggtgaagcat 60 
cagttgtatt ggtgggaacc tgccgtttgc tgtccccttt ttcctcatgc cttttcctgc 120 
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ctctctgatc ttttctaggt ctctggccta tcaggaggac aactggtgct gcaatagaag 180 
ccagtggcta agtctcgtgt atggcgtggt taaggttgca gcctctcacc tctgccttcc 240 
tccattttgg gctggttacc tttgtgctct tcctgaatgg tcttcgagca gaggctggtg 300 
gctcagggga cgtgccaagc acagggcaga acaatgagtc ctgttcaggg tcatcggact 360 
gcaaggaggg tgtcatcctg ccaatctggt acccggagaa cccttccctt ggggacaaga 420 
ttgccagggt cattgtctat tttgtggccc tgatatacat gttccttggg gtgtccatca 480 
ttgctgaccg cttcatggca tctattgaag tcatcacctc tcaagagagg gaggtgacaa 540 
ttaagaaacc caatggagaa accagcacaa ccactattcg ggtctggaat gaaactgtct 600 
ccaacctgac ccttatggcc ctgggttcct ctgctcctga gatactcctc tctttaattg 660 
aggtgtgtgg tcatgggttc attgctggtg atctgggacc ttctaccatt gtagggagtg 720 
cagccttcaa catgttcatc atcattggca tctgtgtcta cgtgatccca gacggagaga 780 
ctcgcaagat caagcatcta cgagtcttct tcatcaccgc tgcttggagt atctttgcct 840 
acatctggct ctatatgatt ctggcagtct tctcccctgg tgtggtccag gtttgggaag 900 
gcctcctcac tctcttcttc tttccagtgt gtgtccttct ggcctgggtg gcagataaac 960 
gactgctctt ctacaaatac atgcacaaaa agtaccgcac agacaaacac cgaggaatta 1020 
tcatagagac agagggtgac caccctaagg gcattgagat ggatgggaaa atgatgaatt 1080 
cccattttct agatgggaac ctggtgcccc tggaagggaa ggaagtggat gagtcccgca 1140 
gagagatgat ccggattctc aaggatctga agcaaaaaca cccagagaag gacttagatc 1200 
agctggtgga gatggccaat tactatgctc tttcccacca acagaagagc cgtgccttct 1260 
accgtatcca agccactcgt atgatgactg gtgcaggcaa tatcctgaag aaacatgcag 1320 
cagaacaagc caagaaggcc tccagcatga gcgaggtgca caccgatgag cctgaggact 1380 
ttatttccaa ggtcttcttt gacccatgtt cttaccagtg cctggagaac tgtggggctg 1440 
tactcctgac agtggtgagg aaagggggag acatgtcaaa gaccatgtat gtggactaca 1500 
aaacagagga tggttctgcc aatgcagggg ctgactatga gttcacagag ggcacggtgg 1560 
ttctgaagcc aggagagacc cagaaggagt tctccgtggg cataattgat gacgacattt 1620 
ttgaggagga tgaacacttc tttgtaaggt tgagcaatgt ccgcatagag gaggagcagc 1680 
cagaggaggg gatgcctcca gcaatattca acagtcttcc cttgcctcgg gctgtcctag 1740 
cctccccttg tgtggccaca gttaccatct tggatgatga ccatgcaggc atcttcactt 1800 
ttgaatgtga tactattcat gtcagtgaga gtattggtgt tatggaggtc aaggttctgc 1860 
ggacatcagg tgcccggggt acagtcatcg tcccctttag gacagtagaa gggacagcca 1920 
agggtggcgg tgaggacttt gaagacacat atggggagtt ggaattcaag aatgatgaaa 1980 
ctgtgaaaac cataagggtt aaaatagtag atgaggagga atacgaaagg caagagaatt 2040 
tcttcattgc ccttggtgaa ccgaaatgga tggaacgtgg aatatcagat gtgacagaca 2100 
ggaagctgac tatggaagaa gaggaggcca agaggatagc agagatggga aagccagtat 2160 
tgggtgaaca ccccaaacta gaagtcatca ttgaagagtc ctatgagttc aagactacgg 2220 
tggacaaact gatcaagaag acaaacctgg ccttggttgt ggggacccat tcctggaggg 2280 
accagttcat ggaggccatc accgtcagtg cagcagggga tgaggatgag gatgaatccg 2340 
gggaggagag gctgccctcc tgctttgact acgtcatgca cttcctgact gtcttctgga 2400 
aggtgctgtt tgcctgtgtg ccccccacag agtactgcca cggctgggcc tgcttcgccg 2460 
tctccatcct catcattggc atgctcaccg ccatcattgg ggacctggcc tcgcacttcg 2520 
gctgcaccat tggtctcaaa gattcagtca cagctgttgt tttcgtggca tttggcacct 2580 
ctgtcccaga tacgtttgcc agcaaagctg ctgccctcca ggatgtatat gcagacgcct 2640 
ccattggcaa cgtgacgggc agcaacgccg tcaatgtctt cctgggcatc ggcctggcct 2700 
ggtccgtggc cgccatctac tgggctctgc agggacagga gttccacgtg tcggccggca 2760 
cactggcctt ctccgtcacc ctcttcacca tctttgcatt tgtctgcatc agcgtgctct 2820 
tgtaccgaag gcggccgcac ctgggagggg agcttggtgg cccccgtggc tgcaagctcg 2880 
ccacaacatg gctctttgtg agcctgtggc tcctctacat actctttgcc acactagagg 2940 
cctattgcta catcaagggg ttctaa 2966 

<210> 42 
<211> 4404 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 6046484CB1 

<400> 42 

atgtatatcc gtgtatccta tgacaccaag ccagactcac tgctccatct catggtgaaa 60 

gattggcagc tggaactccc caagctctta atatctgtgc atggaggcct ccagaacttt 120 

gagatgcagc ccaagctgaa acaagtcttt gggaaaggcc tgatcaaggc tgctatgacc 180 

accggggcct ggatcttcac cgggggtgtc agcacaggtg ttatcagcca cgtaggggat 240 
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gccttgaaag accactcctc caagtccaga 
tggggcatcg tggagaataa ggaagacctg 
accatgtcca accctctaag taagctctct 
ctggctgaca atggcaccct gggcaagtat 
gaaaagcata tttcactcca gaagataaac 
gcactcatag tggaaggagg acccaatgtg 
acccctcccg tgccagtggt tgtctgtgat 
tttgggcata aatactcaga agaaggcgga 
ttggtgacta tacagaagac tttcacatac 
atcctcatgg agtgcatgaa gaagaaggaa 
ggacaccagg acattgattt ggctatcctg 
gccccagacc aactgagctt agctttagcc 
atctttattt acgggcaaca gtggccggtg 
ttagttctgg acagagtgga ttttgtgaaa 
cgttttctca ccatctccag actagaggaa 
acattgtacc acttggtcag ggatgtcaaa 
agcctgattg acatcggcct ggtgatcgag 
tacacgcgca agcgcttccg gaccctctac 
attcccttga ggcgaggaag aaagacaacc 
ttggatgatc ctgagatcaa ccacttcccc 
gttctcatga agcggcagaa gatggccctg 
gccaaggccc tggtggcctg caagctctgc 
gacatggttg acgacatttc ccaggagctg 
gctgtggagc tcctggacca gtcctacaag 
acgtatgagc tgaagaactg gagcaacgcc 
caccgcgact tcatcgcgca cacgtgcagc 
cggctccgca tgcgcaagaa ctcaggcctc 
accatcttgt ttttggaatt tcgcacatat 
aatgaggatg gcaaagaaaa agaagaggaa 
agaaaggggg atgaggagaa cgagcacaaa 
atctgtgaat tctataacgc gcccattgtc 
ggctacctgc tgctgtttaa ctacgtcatc 
caggagtgga tcgtcatctc ctacatcgtg 
gctactccaa aggcctcccc cagcccacta 
tactggaaca tcacagatct cgtggccatt 
ctacagaacc agccctacat gggctatggc 
tggtacatcc gtgtcctgga catctttggt 
atgattggaa agatggttag tagtggcatc 
atgagtttcg gagtagcccg tcaagccatt 
ctggcccgaa acatcttcta catgccctac 
cagatagacc gtaagagttt ttttctttca 
gagggcaagc ggcttcctcc ctgtatcccc 
tgctatctac tggtcgccaa catcctgctg 
accttctttg aagtaaaatc aatatccaac 
attatgacat ttcatgacag gccagtcctg 
tacatcatca ttatgcgtct cagcggccgc 
gaacgggatc gtggattgag tatgttcctt 
ttcgaggagc agtgcgtgca ggagcacttc 
agcgacgagc gcatccgggt cacttctgaa 
gaaatcaatg aaagagaaac ttttatgaaa 
gctcagctag aagaattatc taacagaatg 
gacaggtctg acctgatcca ggcacggtcc 
cttctccggc aaagcagcat caatagcgct 
aacggagaag agttattatt tgaggataca 
aggaaaaaaa cctgttcctt ccgtataaag 
ccagaatgtc agaacagtct tcacctttca 
ggcagtcacc ttgcagtaga tgacttaaag 
attgggattt caaaggaaga tgatgaaaga 
tccccaagtt taaataaaac agatgtgata 
actcagctaa cagtggaaac gacaaatata 
accaaaatta cacgctattt ccccgatgaa 
agaagcttcg tctattcccg gggaagaaag 
tacagttcaa tcacggacca gcaattgacg 



ggccgggttt gtgctatagg aattgctcca 3 00 
gttggaaagg atgtaacaag agtgtaccag 360 
gtgctcaaca actcccacac ccacttcatc 420 
ggcgccgagg tgaaacttcg aagacaactg 480 
acaagaatcg gtcaaggtgt tcctgtggtg 540 
atctcgattg ttttggagta ccttcgagac 600 
gggagtggac gggcatcgga catcctggcc 660 
ctgataaatg aatctttgag ggaccagctg 720 
actcgaaccc aagctcagca tctgttcatc 780 
ttgattacgg tatttcggat gggatcagaa 840 
acagctttac tcaaaggagc caatgcctcg 900 
tggaacagag tcgacatcgc tcgcagccag 960 
ggatctctgg agcaagccat gttggatgcc 1020 
ttactcatag agaatggagt aagcatgcac 1080 
ttgtacaata cgagacatgg gccctcaaat 1140 
aaggggaacc tgcccccaga ctacagaatc 1200 
tacctgatgg gcggggctta tcgctgcaac 1260 
cacaacctct tcggccccaa gagggatgat 1320 
aagaaacgtg aagaagaggt ggacattgac 1380 
ttccctttcc atgagctcat ggtgtgggct 1440 
ttcttctggc agcacggtga ggaggccatg 1500 
aaagccatgg ctcatgaggc ctctgagaac 1560 
aatcacaatt ccagagactt tggccagctg 1620 
caggacgaac agctggccat gaaactgctg 1680 
acgtgcctgc agcttgccgt ggctgccaaa 1740 
cagatgctgc tcaccgacat gtggatgggc 1800 
aaggtaattc tgggaattct tctacccccc 1860 
gatgatttct cgtatcaaac atccaaggaa 1920 
aatacggatg caaatgcaga tgctggctca 1980 
aaacagagaa gtattcccat cggaacaaag 2040 
aagttctggt tctttcagat atcatacttg 2100 
ctggtgcgga tggatggctg gccgtccctc 2160 
agcctggcgt tagagaagat acgagaggtg 2220 
gccagaaaat caaagtttgg cttccaggag 2280 
tccacattca tgattggagc aattcttcgc 2340 
cgggtgatct actgtgtgga tatcatcttc 2400 
gtcaacaagt atctggggcc atacgtgatg 2460 
ctgtgggtag tcatcatgct ggtcgtgctc 2520 
ctgcatccag aggagaagcc ctcttggaaa 2580 
tggatgatct atggagaggt gtttgcagac 2640 
gctccttgtg gtgagaacct atatgatgag 2700 
ggcgcctggc tcactccagc actcatggcg 2760 
gtgaacctgc tgattgctgt gttcagcaat 2820 
caggtgtgga agttccagcg atatcagctg 2880 
cccccaccga tgatcatttt aagccacatc 2940 
tgcaggaaaa agagagaagg ggaccaagag 3 000 
agcgacgagg agctaaagag gctgcatgag 3060 
cgggagaagg aggatgagca gcagtcgtcc 3120 
agggttgaaa atatgtcaat gaggttggaa 3180 
acttccctgc agactgttga ccttcgactt 3240 
gtgaatgctc ttgaaaatct tgcgggaatc 3300 
cgggcttctt ctgaatgtga ggcaacgtat 3360 
gatggctaca gcttgtatcg atatcatttt 3420 
tctctctcca cgtcaccagg gacaggagtc 3480 
gaagagaagg acgtgaaaac gcacctagtc 3540 
ctgggcacaa gcacatcagc aaccccagat 3600 
aacgctgaag agtcaaaatt aggtccagat 3660 
cagacagact ctaaaaaaga agaaactatt 3720 
catggacagg acaaatcaga tgttcaaaac 3780 
gaaggcacta tttcctatcc cctggaagaa 3840 
acgatcaatg cttgtaaaac aatgaagtcc 3900 
ctggtcggtg gggttaacca ggatgtagag 3960 
acggaatggc aatgccaagt tcaaaagatc 4020 
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acgcgctctc 
cataaagagc 
atccctcgct 
aagccagatc 
aggaatgtga 
agcttagtaa 
tccacagaaa 



atagcacaga 
agtttgcaga 
tgtccctaac 
aaactttggg 
aatccattca 
ttgtgtctgg 
ctgaatgcta 



tattccttac 
tatgcaagat 
cattactgac 
attcccatct 
gggaaagtta 
aatgacagca 
gtct 



attgtgtcgg 
gaacaccatg 
agaaatggga 
ctcaggtcaa 
gacagatctg 
gaagaaaaaa 



aagctgcagt 
tcgctgaagc 
tggaaaactt 
aaagtttaca 
gacatgccag 
aggttaagaa 



gcaagctgag 
aattcctcga 
actgtctgtg 
tggacatcct 
tagtgtaagc 
agagaaagct 



4080 
4140 
4200 
4260 
4320 
4380 
4404 



<210> 43 
<211> 669 
<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 
<223> Incyte ID No: 



7481427CB1 



<400> 43 

atgacggaac 

atctccaaga 

gccagcatgc 

atccccaaag 

tactccccca 

gcgggcgtgg 

ggcacggccg 

aagtcaggca 

tccgacggca 

taccaggcag 

aacacacaca 

ctctcctaa 



aagccatctc 
cagccgtggc 
cgatggccgc 
atcagggcgt 
cgcaagccct 
acaogcacac 
tcgtgt.at.cc 
cggagcgcga 
tccggggcct 
cctacttcag 
tcctggtgag 



cttcgccaag 
atcgatcaag 
cgccaagcag 
gctgtccttc 
caatttcgcc 
gcagctctgc 
cctggatttc 
gttccgaggc 
ttaccagggc 
ggtgtacgat 
ctggatgacc 



gacttcctag. 
cgggtccagc 
tgcaagggca 
tggaggggca 
ttcaaggata 
aggtactttg 
accagaaccc 
ctgggagact 
ttcagcgtct 
acggccaatg 
gcgcagaccg 



ctggaggcat 
tgctgctgca 
tcgtggactg 
acctggccaa 
agtacaagca 
cgggcaacct 
gcctggcagc 
gcctagtgaa 
ccgtgcaggc 
gcatgttccc 
tgacggccgt 



caccgccgct 60 
gatgcagcac 120 
catcgtccgc 180 
tgtcatccgc 240 
gatcttcctg 300 
ggcctctggc 360 
cgacgtggga 420 
gatcagcaag 480 
catcatcatc 540 
cgaccccaag 600 
ggctggcgtg 660 
669 



<210> 44 

<211> 1823 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 
<223> Incyte ID No: 



7483595CB1 



<400> 44 

aaaaaaaaac 

ttgaagcaga 

actgcactcc 

aggcttgttc 

ctagggcaag 

agcacttgtt 

ttagggaaag 

ccttcttttc 

tctggttttc 

ttaccttccg 

gtcccagctg 

cttctatgac 

agtatttcta 

tcagtatccc 

atctttggtg 

ttacagtgtc 

atttttcaga 

tggaggaagc 

tcaaatccct 

tcctggggtg 

aaagcataag 

tttcagcttg 



aaataaaaaa 
gaatcccttg 
ggcctgggca 
tgatcacctt 
aacttaagtg 
gggagcctct 
agagttacaa 
ctggagagtg 
tcttctctgc 
gagtctgcaa 
actttcatct 
ataattaatt 
actgttgttg 
attgagtttc 
agctcctttc 
aattccaacc 
tcttgtgtgt 
acaacagttt 
aacaccaaca 
aaaat-cttcc 
ctgttaaaag 
tttaattcaa 



tgcccagtgt 
tgcccaggag 
acagagtgag 
gtccctgttt 
agctgactgc 
gatcaacaat 
gagttaccaa 
ccattggtgc 
agctaataat 
tgagtgctta 
ttgggattat 
actgtgtagc 
ttgctctgcg 
ccatggaatt 
tgctcatatt 
aggatttaat 
ttactggtgc 
gcatctctgg 
tttatagaag 
agtgctgcag 
aggttgatat 
gtgacaccaa 



gagcatctgt 
tttgaggtta 
atcctgtctc 
tccccaagtc 
tcacaaagaa 
acaaggaccc 
aggggcctat 
cattgctgga 
gggcgtattg 
cctggctgct 
gattagtttc 
tctcccaaaa 
aatcaacaaa 
atttctgata 
tctgggcaag 
agccatcggc 
tattgctagg 
taggcgcaga 
catcaatgat 
ctcaattaca 
ggtaaaggtg 
tctacaagga 



agtcccagct 
cagtgagcta 
ataaataaat 
aacctagaga 
ggaaaggagt 
gtgttatcaa 
cttaactttt 
ttgaagacat 
ggtttgggct 
gtggcacttc 
catgccggtc 
gcgaattcca 
tgtatcagaa 
attttacaag 
aagattgcca 
ctttgcaatg 
actattatcc 
gctaagattc 
tatcgggaga 
tttgtaaatg 
cctcttaaag 
ggaaagattt 



atttgggaag 
tgattgcacc 
aaataggcaa 
gcagtagatt 
agtgtgagct 
cttctgttca 
ctatggaaac 
tctaccccac 
tcattgccac 
atatcatgct 
ccatctcctt 
ccagcattct 
tttctttcaa 
ccttctcctt 
gtcttcacaa 
tcgtcagttc 
aggataaatc 
ttctcctggg 
tcatcaccat 
tttactacct 
aagaagaaat 
gcaggtgttt 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 
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ctgcaactgt gatgatctgg agccgctgcc caggattctt tacacagagc gatttgaaaa 13 80 

taaactggat cccgaagcat cctccgttaa cctgattcac tgctcacatt ttgagagcat 144 0 

gaacacaagc caaactgcat ccgaagacca agtgccatac acagtatcgt ccgtgtctca 1500 

gaaaaatcaa gggcaacagt atgaggaggt ggaggaagtt tggcttccta ataactcatc 1560 

aagaaacagc tcaccaggac tgcctgatgt ggcggaaagc caggggagga gatcactcat 1620 

cccttactca gatgcgtctc tactgcccag tgtccacacc atcatcctgg atttctccat 1680 

ggtacactac gtggattcac gggggttagt cgtattaaga caggtaagta ctgaggaggc 1740 

cttggcagga gctctgatcc ccctcctacc atcccaaccc caccctgatc ctgattgaat 1800 
tctgttagtc actgatcagc tgc 1823 

<210> 45 

<211> 2931 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> incyte ID No: 3788427CB1 

<400> 45 

atgtccatag ttcgtttatc cgtccatgcc aaatggatta tggggaaggt gactgggaca 60 

aaaatgcaaa agactgctaa agtgagagtg atcaggcttg ttctggatcc ccatttatta 120 

aagtattata ataagcaaaa aacgtacttt gctcacaatg cccttcagca gtgcacaatt 180 

ggggatatcg tgcttctcaa agctttacct gttccacgaa caaagcatgt gaaacatgaa 240 

ctggctgaga ttgttttcaa agttggaaaa ctcgtagatc ctgtgacagg aaaacctcgt 300 

gcaggaacta cctatctgga gagtccattg agttcggaaa ccacccaggg cgtagacggg 360 

gcttcccgtc cctcacgcgg gcccgcccca tgccgggcgg ggcctggggc ccgccgcctc 420 

cgcccgtggc cggagagccc caggcccgag ccacgcggtc ttccgggccc gggtcggggc 480 

tcaatggcta cgtggaggcg ggacggccga ctgacaggcg gccaaaggct gctgtgcgct 540 

gggctggcgg ggacgctcag cctcagcctc accgcgcccc tggagctcgc caccgtgctg 600 

gcccaggttg gcgtcgtgcg aggccacgcc cggggaccgt gggccacagg gcaccgggtg 660 

tggcgggcag aggggctccg ggccctgtgg aaggggaacg cggtggcgtg cctgcgcctc 720 

ttcccctgca gcgccgtgca gctcgccgcc taccgcaaat ttgttgtgct gttcacagat 780 

gacctgggcc acatttccca gtggagctcc atcatggctg ggagtctcgc aggcatggtt 840 

tccaccattg taacatatcc tacagacctc atcaaaaccc ggttgatcat gcagaacata 900 

ctggaaccat cgtacagggg gctcctccat gctttttcta ctatttacca acaggaaggg 960 

ttccttgccc tttatcgagg ggtttccctc actgttgtag gtgctctccc gttctctgct 1020 

ggctcccttc ttgtttacat gaacctggag aaaatctgga acggaccccg agatcagttc 1080 

tctctcccac agaactttgc taatgtctgt ctggctgctg cagtgaccca gaccctctcc 1140 

tttccctttg agaccgtgaa gagaaagatg caggctcaga gcccctacct cccacacagt 1200 

ggaggagtag atgtccattt ctcaggagca gtggactgct tccggcagat agtgaaggcc 1260 

cagggggtcc tggggctctg gaatggattg acagccaatt tactgaagat agttccatat 1320 

tttggaatta tgtttagcac ctttgagttc tgcaagagaa tctgtcttta tcaaaatggt 1380 

tacattctgt ctccactgag ctataaattg accccaggag tcgatcagag tttgcagccc 1440 

caggaattac gagaattaaa gaagttcttc aaaacgagaa aactgaagcc taaaaaacca 1500 

actctataaa atggaatgga actagaagtg cactgactga cagcatgttg caatcacaga 1560 

taaatgtagc ctcataactg tgcacctgag gagggatgag aagctactgc tatctgctgc 1620 

tctgggaaat gagatggttt ggccatgcac cacctcagat ctccaaactg atgtccctgt 1680 

gaacacctgc actatgagaa ggtttcccaa caagagtgta tcatgatggc attccacgag 1740 

attttataac cagagtctag cagtgataaa atgtacaatt tacgctctga atagaaacat 1800 

ggcaccaaag aaagggcatt gccatgacat ctcaaagcaa caggtaattg aagtggctga 1860 

gtggatacca cgttgtgagg tcagccactg ggtcacttgt agtcaatgca aagcttgcac 1920 

aaaccaacgt cagcctagtt tcactcagtt tgacctgaga ctttccaata caatctgttg 1980 
ttttgttttg tttttagaga cagggtcttg ctctgtcacc cagactggag tacaatgaca 2040 

caatcatagc tcactgcagc ctttaactac tgggctcaag acatcctcct gcttcagcct 2100 
ccagagagtt gggaccatag gtgcacacca ccacacctag ctaatttttg ggggaggtct 2160 

tgctatgttg cccaggctgg tcttgaactc ctgggctcat gcaatcctcc tgccttggcc 2220 
tcccaaagcg ctaggattag aggtgtgagc cgctgcaccc tgccccagta caatcttttt 2280 

tgaactcaaa tttttgctga catctgagtg cacacaccac agtgtaaatt atgccttatc 2340 
agaatctaaa tgaaaatagc gaacatttaa aagctatcac cattgtagta gaatcatcct 2400 
tcttttttga aatttgaagc atcccaggct taaaatcttg tgtttcagaa agacagttta 2460 
taccatgact gcttaattat ccccccaaag accttctgat tgaagtcatg tacagttcag 2520 
tggcctaaat tctctgcctt tttaacttgc tttgcaagcc tactctgaaa ataagttatt 2580 
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y tagtcaagtt attctcaaag atgtcccagt tgcctagaaa ggatcaaatg gaacatttga 2640 

cacacatact caaaaaaatg taactgacta taaacacttt aacctaatca tctgtatcaa 2700 

actttctaaa aatcaaatct caggattgtt ccactttaga gattctatgt aaagtttata 2760 

taactatact tgtcaaatag cacctatcta tgcatttaaa aatgcattaa tatctaacat 2820 

aaaattgtta atttatacac acacatacac ttatacatag agagagattt tcggtatgtt 2880 

gcaagtattt tttccgatta aatttatttt ccaaagaaaa aaaaaaaaaa a ^ 2931 

<210> 46 

<211> 1492 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 6972455CB1 

<400> 46 

cttgctcctt gatcagtttg cggatggcct gcaaggccgc ttccggtacg gcttgggcaa 60 
tggaattcat aaaaagctca gggttcccga tagcgtgtgt gacccgttgt aaaactattc 120 
atagctggct gcaagccacg ccccaaaggc gctcgcggct ttttgcaaac gctcgaaaag 180 
ggtgcgggaa acctgaccac tgttcagacc agtaagacca ctgcacctca acaaacatca 240 
gccttgcaga cgatccgaac ggtgctggca tgggctgtgc actgggcaaa ctcagaaaat 300 
tccttcgcct ttttcggaga gctgccatga ccaagcgcta cagcgccctg ctcactgccc 360 
tgtttgccag cctgatgctg agccaggcac ccgcccaggc gagtggtctg gatgacatcg 420 
tcgcccgtgg caccctgaag gttgccgtcc cccaggactt cccgccgttt ggttcggtcg 480 
gccctgacat gcaaccccgg ggcctggaca tcgacaccgc caagctgctg gccgaccaac 540 
tcaaggtcaa gctggaactg acgccggtca acagcaccaa ccgcataccg ttcctcacca 600 
ctggcaaagt cgacctggtt atttccagcc tgggcaagaa ccccgagcgc gcaaaggtca 660 
tcgacttctc caacgcctac gcgccgttct accttgccgt gtttgggccg cctgatgccg 720 
ccatcgccag cctcgacgac ctcaagggca agaccatcag cgtgacccgt ggcgccatcg 780 
aagacatcga gctgaccgcg gtagcgccca aggaagccac gatcaagcgt tttgaagaca 840 
acaactccac catcgccgcc tacctggccg gccaggtcga cctgatcgcc agtggcaacg 900 
tggtcatggt ggccatcagc gaacgcaacc ccaagcgcgt accggcactg aaggtgaaac 960 
tcaaggattc gccggtgtac gtaggcgtga acaagaacga gccggcgctg ctggaaaagg 1020 
tcaaccagat cctggtggca gccaaggctg acggcagcct tgggaaaaac gcgatgcagt 1080 
ggctcaaaga accgctgcct gccgacctct gacgcggagc tcaaagatgg cctatcaatt 1140 
cgactttgtg ccggtgctgg ccaataccga cctgctcctg cgtggcgcgc tgttcaccct 1200 
tgagctgacg gccattggcg ccatcctcgg cgtggccctg ggcactgtcg gcgccgtggt 1260 
acgggcgtgg aagatccaac cgtttgcgtg ggtctttggt gtgtatgtcg agttgatccg 1320 
caacacgccg ttcctggtgc aactgttctt catcttcttc ggcctgccat ccctcgggtt 1380 
gaaaattacc gagtggcaag ccgccgtgct ggcgatggtg atcaacctgg gggcctactc 1440 
caccgagatc atccgtgccg gcatccaggc catcccacgg gggcaactgg ag 1492 

<210> 47 

<211> 2406 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 8077668CB1 

<400> 47 

atgtatgacc caccaaaagt gaataagaaa aatgactctg ctgatcattc tgatgaatct 60 
gtggaaatac agtcagccct tcatatctgt ggatttgacc aaggcaggtg gatcctctcc 120 
taccgccggt cacccgcgcg ccggcacacg ggcctggccc cagtacttgg ggtgcgggtc 180 
cgcgtctgcg tggggtgcca aggggcccgg gtcccagcgt ggcccaaacg tgagggtagg 240 
ggctccgcgg aggagccagt agagagaaag gatggacaac aaggggagag gagaggagaa 300 
gccagcctga ccaaggatgc tgttaccttc cagagcagtc tccctgtcca gatcaccagg 360 
atccctgctt gctggagaat gacccagatg tcccaggtgc aggagctctt ccatgaggca 420 
gcccagcagg atgccctggc ccagccccag ccctggtgga agacccagct gttcatgtgg 480 
gagcctgtgc tgtttgggac ctgggatggt gtgttcacat cctgcatgat caacatcttt 540 
ggggttgtgc tcttcctgag gactggctgg ctggtgggaa acacaggagt gctcctgggc 600 
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atgttcctgg 
gtcggggagc 
ggtgggcaga 
ggctttgctg 
atttcagttg 
atccgcctcc 
ggttctttca 
cagaacaaca 
ttcttcccag 
cctgccgcca 
atcggctaca 
cagcattccc 
cagtctgagc 
tctgagagtg 
ggtgggctgg 
gggaatgctg 
ttcctcggcc 
aaccaccgag 
cgaattgggg 
tgagttctct 
ttcctagatt 
cttcccccag 
tcacactgtg 
gaatattgtt 
gaagaggcaa 
tattctaacc 
tcttagctta 
gtgcaataac 
aataaaagaa 
ttccaggagg 
gaacct 



tgtccttcgt 
gcagcagcat 
cgggaggcac 
aatccatctc 
cggtgcttct 
agctgctgtt 
cccacctgga 
cgctgcccga 
cggctacagg 
gcattcccct 
ggagtcatgg 
ctgggctccc 
tagagagtgg 
ccccgactgt 
aaaattcacg 
ccgaggagga 
tcccgccccc 
tttccaggat 
ctgcatctca 
gagcctccat 
gcggcgggga 
tgagcctccc 
ccacccacac 
tctctctgta 
agagatcaga 
acagtatt ta 
ctgcagtgaa 
attgggaaga 
agaggct tct 
aacccttggc 



catcctggtg 
cggcagcggt 
catcgggctg 
ggatttgctg 
ggccttgctg 
gctgttcctg 
cccagaacat 
ttacagcccg 
agtcatggcc 
gggctccctg 
ccggcttcaa 
tggcagctgt 
gagagaaaac 
gaccaatcta 
tctggtgggt 
gtgtcgggac 
gtccctcccc 
cactggaggg 
gctctgccgc 
ttcctttctt 
gagaaggcac 
ctccagctcc 
aagcaggctg 
ccacaggaag 
gaccagaaac 
gaaaatatta 
accattgcta 
aaatgaagac 
gaatgaggac 
ccaccctctg 



gccctcgtca 
ggcgtctact 
ctctatgtgt 
ggcctcggga 
ggcattaacc 
ctggccgtgt 
ggtttcattg 
ggggaatctt 
ggcttcaaca 
gcagctgttg 
catggggggc 
tggcatctcg 
cccaggcaga 
tttcatattc 
tcagacactg 
ccagatcaca 
cacccccatt 
agcataaaga 
tgattggctg 
gtgaaatggg 
acagtggaag 
cccaggccgt 
ggtaaccttg 
aagcttctct 
aatgacttga 
atgtatctgt 
tcagcctggc 
aacagagata 
cctgtcttgt 
attacccaga 



cggtgctgtc 
ccatgatctc 
ttggacagat 
atatctgggc 
tcgcaggtgt 
ccacactgga 
gatattcacc 
ttttcactgt 
tggggggcga 
gcatctcgtg 
gacctcaggg 
taagttcccc 
agagggattg 
cgatccttct 
ccctttcatg 
gatcgaatcc 
ttgaacttgg 
tgaagagttt 
catgactctg 
gataatgaca 
ctcagcaccc 
caggcctctg 
ggtctatttc 
gcccttcatt 
gaagttaaaa 
ttcacaaaca 
gtacagcatc 
gagttgagtg 
ttctggcccc 
tgggaatgag 



tgacattggc 
ctcggtcctg 
gtatatcacc 
tgtgcgagga 
caaatggata 
ctttgtggtg 
cgaactgcta 
ctttggggtt 
cctcagggag 
gtttctgtac 
agcctgccgc 
tcatggcaat 
ccccttttgt 
tttaacactc 
cgccctgcct 
tttctttcct 
ggattttttt 
tgcaatcaga 
gacaatttac 
ctgtatgtgc 
agctcttctt 
gggttttagt 
tgacctttta 
gggtaggtgt 
gggcctgtgt 
ctgaatttgt 
agggtatttt 
ttgggaagga 
agctgggagg 
gagaccacag 



660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2406 



<210> 48 

<211> 3686 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 55120485CB1 

<400> 48 

atggaagaga acagtaagaa ggaccatcgg gctttgctca accagggaga ggaggatgaa 60 
ctggaggtgt ttggttaccg ggaccacaat gtacggaaag ccttctgcct tgtcgcatcc 120 
gtgctgacct gtgggggcct tctgctggtg ttctactgga gaccccagtg gagagtgtgg 180 
gccaactgca tcccatgccc cttgcaagaa gcagacactg ttttgctgag gacaacagac 240 
gaatttcaaa gatatatgag gaagaaggta ttctgcctct acttatacac actgaagttt 300 
cctgtaagca agaagtggga agaatccctg gtggctgacc gccactctgt cataaaccaa 360 
gccttaataa agccagaatt aaaactgcgg tgcctggaag tgcagaaaat caggtatgtt 420 
tggaacgacc tggagaagcg gtttcagaaa gttgggttgc tagaagacag caattcctgc 4 80 
tctgacatcc atcagacatt tggattgggt ctgaccagtg aagagcaaga ggtcagaaga 540 
ttagtgtgtg ggcccaacgc cattgaggtt gaaatccaac ccatatggaa gctgcttgtt 600 
aaacaggttt taaatccatt ctatgtgttc caagccttca ccctaacttt gtggctgtct 660 
caaggttaca tagaatactc tgtggccatc atcattttga ctgttatctc cattgtctta 720 
agtgtgtatg atttgcgaca gcaatcagtt aagctgcata acctcgtgga ggaccacaac 780 
aaagtccagg ttacaatcat tgtaaaagac aaaggtttgg aggagctgga atcccgtctc 840 
ttggttcccg gagacattct tattcttcca ggaaaatttt cattgccatg tgatgctgtt 900 
ttgattgatg gaagctgcgt ggtgaatgaa ggcatgctta caggagaaag tatacctgtt 960 
acaaagacac cattgcccca gatggagaac actatgcctt ggaaatgtca cagtttggag 1020 
gattatagga aacacgtcct tttctgtgga acagaagtta tccaggtcaa gccctctggg 1080 
caggggcctg tacgagcagt cgttttgcaa acaggttaca atacagccaa aggggactta 1140 
gtgagatcca tcctgtaccc ccggcctctg aacttcaaac tatacagcga tgccttcaag 1200 
ttcatcgtgt tcctggcctg ccttggtgtc atgggttttt tctatgccct aggggtatat 1260 
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BNSOOCIO. <WO 024641 5A2JA> 



atgtaccatg gagttcctcc aaaagatact 
actgtccctc cagtgctgcc agctgccctg 
ctgaagaaaa agaaaatctt ctgtatctcc 
aacctcgtgt gctttgacaa aactggcact 
actgtcccta ctgctgacaa ctgcttccag 
gtgccatgga gcccactgtg tgcggccatg 
gggaccatcc agggagaccc tctggacctc 
gaagattgca ttgtagactc ctgcaaattt 
ggaccaaaag ccagtaagag tccagtggaa 
tcctcgagcc tgcagaggat gtccgtgatc 
gtctacatga aaggtgcccc agaaatggtg 
aagaatttcc cacaggaact gaggagttac 
gcccacaaaa ccttaaagat ggggaatctt 
gtggagtcag agttaacatt tctgggactt 
accaaactgg tcttgaagga actgagtgag 
gataaccttc aaacggccat tactgttgca 
caagtgatca ttgttgaggc cgatgaacca 
cagctggtgg agaaccaaga gactggacct 
aacagttcaa cccctcgtgg ggaaggagga 
tcataccaag tgatatttca gcatttcaac 
acagtttttg caagaatgtc tcctgggcag 
ttaaatgcct gcacagtaca aaatgaaagc 
ccagaaaaaa tggaaagtaa tagcaccttc 
ttgggaacaa tcaactgtat cactgtggct 
cagccaactt atacaaacta tatatttgtc 
ctattcattc tatttgctga tataccagaa 
cccgtcctgt ggagggcctc cattgtcatc 
gtggccgagg aggctgttat tgaaaatcga 
ggctatcagt caaaaagcca gtatcggata 
tggcccccgc taaaccaaac ctcccactct 
tacagcaatc cagtatttga gagcaatgaa 
gttgatatta cagagaaaca atgacaaaag 
actcttgtaa catcagctgg agttttgggg 
gtctgtttga taaaatactg tctacagaga 
ttggcttctg aaggtattag aggcattcag 
gggccaagag acaagcaggc taaaggggaa 
caatactttc ctttttataa aaatgagttg 
gttgtaattt ttgaattttt acttcatcga 
ccaatcttta ctgtctgggt tccttttctc 
ttctgtttct ggccttaccc aacatattgt 
aaaacaactc tcaaaaaaaa aaaaaa 



gtgaccatgg ccctgatcct cctcaccgtg 1320 
accataggca acgtgtatgc tcagaagaga 1380 
ccacagagaa tcaacatgtg tgggcaaata 1440 
ctcactgaag atgggctgga cctctggggg 1500 
gaagcccaca gctttgcctc aggccaggct 1560 
gccagctgcc actctctgat ccttctcaat 1620 
aaaatgtttg agggcactgc ctggaaaatg 1680 
gggacgtcag tttcaaacat cataaaacca 1740 
gccatcatca ccttgtgcca gtttccattt 1800 
gctcagctag ctggggagaa tcatttccat 1860 
gccaggttct gcagatctga aacagtgccc 1920 
acggtgcaag gcttccgtgt cattgctctt 1980 
tcagaagtcg agcacttagc cagagaaaaa 2040 
ctcatcatgg agaatcgctt gaaaaaagaa 2100 
gcccgtatca ggactgtgat gattacaggt 2160 
aagaattctg aaatgatccc tccaggcagc 2220 
gaagaatttg ttcctgcctc tgtgacctgg 2280 
gggaagaaag aaatctacat gcatactgga 2340 
agctgttacc attttgcaat gagtgggaaa 2400 
agcttgcttc caaaaattct ggtgaatgga 2460 
aaatcaagcc ttattgaaga atttcagaaa 2520 
atctcagagt taaccatgtc tccaactgct 2580 
acaagttttg agaacactac agtctggttc 2640 
cttgtgttct ctaaaggaaa accatttaga 2700 
cttgtgctga taatacagct tggtgtatgt 2760 
ttatatagac gtttggatct gctctgcact 2820 
atgctcagct tgaatttcat tgtgtccctt 2880 
gccctgtgga tgatgattaa aagatgtttc 2940 
tggcagaggg acttggcaaa tgaccctagt 3000 
gacatgccgg agtgtggcag aggagtgtct 3060 
gaacaactct gaaggacatg tgatatccaa 3120 
ggaccagttt cattgatttt aagaaatgag 3180 
ctacctatca aatcatggtg acaaactaaa 3240 
agaacattga cccagaagag tgctttcctc 3300 
tgaaggcaag aatgggccaa gaatttctat 3360 
aaatatgtga tctttcctct taggacagga 3420 
aggggaagta gtactgtgta tactcacgga 3480 
tggaaaaact aatgcgtggg aaacattaca 3540 
tgtagtactt ttcagatttg aggctttttc 3600 
taaggataag cttagcctat gctgcaataa 3660 

3686 



<210> 49 
<211> 2807 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 3112883CB1 

<400> 49 

aagaattcgg cacgaggcca cagccgcggg 
gtgaacagac actcccttct gtagccatgg 
ggatttaccc cactgtgatc ctctgcttat 
aaccattcct tatcccatat ttatctggac 
caaatgagat cttccccgtt tggacatact 
tcctcaccga ttatgtccgc tacaagccag 
ttacctggct gctgctgttg tttggccaag 
tctatgggat ggtcaccgcc gccgaggtgg 
gccccgagca ctaccagaga gtgagcggct 
cagcagggtc ggtigctggct caactcttgg 
tcaacgtcat atccttggcc tctgtctccg 



tgcgtggtcg tgcggcaagt gagcgatttg 60 
attgttacag aacttcacta agcagttcct 120 
ttggtttttt ctccatgatg agaccctcag 180 
cagataaaaa cctgaccagt gcagagataa 240 
cctacctggt gctgctgctg cctgtgtttg 300 
tcatcatctt gcaaggtatc agtttcatca 360 
gagtgaagac catgcaggtt gtagagttct 420 
cctactacgc ctacatatac agcgtggtca 480 
actgcaggag cgtcacgctg gccgcctaca 540 
tatccctggc gaacatgtcg tacttttacc 600 
tggctttcct tttctcactt ttcctaccaa 660 
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tgcccaagaa aagcatgttt tttcatgcaa 
gcgtgaatcc agtattagag gaaactcacg 
aacccacatc agaaatactc agcacttcag 
tgaaaccaag caatgtgact gtggacgttt 
gctactcctc aaaacgtctt ttctactggt 
ttaaccaggt tttgaactat gttcaaatcc 
cttccatcta taatggggcc gtagaagcta 
ttgcagtggg ttatgtgaaa gtcaactggg 
tctcagttgt caatgccggt tctttatttc 
gctatgctgg ctatttgata ttcaagtcca 
ttcagattgc agttaatctg aatgtggaac 
ttattgcctt ggtgattcag accatcatga 
acttgccagt cagcattcag tttttagttt 
ttttcctaat gagaagcatg tatattacct 
gccctgctcc aagtgagaat ccagatgtgt 
tgtcaacaaa actctaacct catcgcaaca 
taataaggaa agattttgag atgggtggca 
aaatctggat tccaatggac ctttcaaaac 
tctatgtgac caa'ttttact ggatgcaatt 
catattgggg accccctgtg actagtagca 
aagacatgca gatggcgtgg aaccaaacca 
attcaccatt tgtgaggaga aatactaact 
tatttgatct taccaaagaa gtcaacacat 
ttcctcttct attaagtgta tttctgttta 
ttgtcccatt tctaagggct tctcttcaac 
tgatagtatt attatttaat ttttttatga 
tgttgaaagt ctccctcaag aatagcttca 
ggtcatttcc cagtggcctg ggagaggcag 
gttaatgcat ctatgggcca ggtgcagtgg 
gccaaggtgg gagttttgct tgaggctggt 
gcctcggccc cccaaagagc tgagattaca 
ctgattttta aatggtagca attgtgctaa 
gttaacattt tttgatgaat tctgctgaaa 
tcaagaaact ttcaataagg gatgcattga 
gtcagatttt attaggctgg acttgtggtt 
gtgagaggta attgaggttg gggtttaaac 

<210> 50 

<211> 2170 

<212> DNA 

<213> Homo sapiens 



aacccagcag agaaataaag aagtcatcaa 720 
aaggtgaagc accaggctgt gaagagcaga 780 
ggaagctgaa taagggccag ctgaacagcc 840 
ttgtgcagtg gttccaagat ttgaaggagt 900 
ctctatggtg ggctttcgcc acagcaggtt 960 
tgtgggatta caaggcgcca tcccaagatt 1020 
ttgcaacctt tggaggggct gtggctgcct 1080 
accttctggg agagctggct ctggtggtct 1140 
tcatgcatta cacagccaat atctgggcgt 1200 
gctatatgct tcttataacc atagcagtat 1260 
gctatgcctt ggtatttgga atcaacacct 1320 
ctgtgattgt agtagatcag agagggctca 13 80 
atgggagcta ttttgcagta attgctggaa 1440 
actcaaccaa atcccagaag gatgtacaga 1500 
ctcacccaga ggaagagagt aatatcatca 1560 
aacgcaacag tggctttcaa agttatgcaa 1620 
tatgttttgc cataacttga catgcttztgc 1680 
cacaacaaaa cctcagtttt agatgagttc 1740 
gacaggaccc gtcatcataa ttaaacaacc 1800 
gctggaaaat tctggttttt atcacttgta 1860 
tgagaaaact ccagccatcc tggagttgat 1920 
ggactgaccc tattgctagg cttaaatact 1980 
gggacctttg tgtcacatga accattttct 2040 
agttacagtt ctctaagaga attacaatgt 2100 
tctaataaca gcattattca cgttatgatt 2160 
ttattttcca ttttgtgctc tgagttttgc 2220 
gatcctcttg tgtatttgca gaatacacaa 2280 
tgagccttct ctccaccacc atagacaggt 2340 
ctcactcctg taatccgagc aatttgggag 2400 
ctcaaactcc tgacttcagg tgatctgtct 2460 
ggcgtgagcc accatgcctg gccaaaacac 2520 
ataacatgtg tcaacaattt tcaaagagtt 2580 
attatctgaa catacctgtt tttcgaaact 2640 
gccgtctgac agtgagaata acgatgctga 2700 
acaactttat cccgcatttt gggaagctaa 2760 
ccttggcatt ttaacct 2807 



<220> 

<221> misc_f eature 

<223> Incyte ID No: 4253888CB1 



<400> 50 

gtgaatgccc cttccatccc ccgcccctgg tgtctcggtc tggtctgagg agacggggac 60 

ccttctcaca ccggccgctg ccgccgccgc gctccggaac agatccagtc cttctgtgga 120 

acttctgaac atcttttatt agtggaaata ttttctacac aatgaagtca acaacttaat 180 

ttaaaccagt gtttgtgcgg ttctgattca tctgctgtgg ttcccgaagc ttgagatcta 240 

aggagtacag ggtcttttgt gatgacaata tgactaatag taaaggaaga tctattaccg 300 

ataaaacaag tggtggtcca agtagtggag gaggttttgt agattggact ttacgtttaa 360 

acacaattca atccgacaag tttttaaatt tactcttgag tatggttcca gtgatttacc 420 

agaaaaacca agaagacagg cacaaaaaac caaacggcat ttggcaagat ggattatcaa 480 

ctgcagtaca gacttttagt aatagatctg agcaacacat ggagtatcac agtttctcag 540 

agcagtcttt tcatgccaat aatgggcacg catcatcaag ctgcagccaa aagtatgatg 600 

actatgccaa ttataattac tgtgatggaa gggagacttc agaaaccact gccatgttac 660 

aagatgaaga tatatctagt gatggtgatg aagatgctat tgtagaagtg accccaaaat 720 

taccaaagga atccagtggc atcatggcat tgcaaatact tgtgcccttt ttgctagctg 780 

gttttggaac agtttcagct ggcatggtac tggatatagt acagcactgg gaggtgttca 840 

gaaaagttac agaagttttc attttagtcc ctgcacttct tggtctcaaa gggaacttgg 900 

aaatgacatt ggcatccaga ttatccactg cagtaaatat tgggaagatg gattcaccca 960 
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ttgaaaagtg 
tgggttttct 
accttgatca 
ttctgcaggg 
ctgataatgt 
tggcttggat 
tagttggtgt 
cagccacaag 
gtagcattgg 
ttgtttacac 
ggatttctac 
gttgttacta 
ttctactgct 
aaagtggtca 
tacaggtatt 
gaaaggaccc 
cagctctgtt 
ttggagacta 
accacttatg 
cttcagttgg 
aaaaaaaaaa 



gaacctaata 
agcagctgtg 
ttccatactt 
aataataatg 
tgctacaccc 
aagtcagggc 
atttttcttg 
aacagttctc 
gggccttatt 
gccagttatt 
ctacctccat 
cccatttaga 
tttagtgatt 
tacttcttta 
taccttgctg 
ggatagtttc 
agccttaagt 
ataaattcta 
gctctttttc 
ccctgattca 



attggcaact 
gcagcaatta 
ctgtgctcta 
gttggggtta 
attgctgcta 
ttatactcct 
gctctaaccc 
cactcaggct 
ctggacacaa 
aatggtattg 
ttacatagca 
actttctttg 
cctggacatt 
actataatct 
tggattgctg 
tccatcccct 
tttcattttc 
caaactgctc 
aaaactctta 
attaaatggc 



tggctttaaa 
tattgggctg 
gcagtgtggc 
tcgttggttc 
gttttggcga 
gtcttgagac 
ctatttggat 
gggagcctgt 
ctgtatcaga 
gtggtaattt 
ttccaggaga 
gtccaggagt 
taattttcct 
tcatagtagt 
actggatggt 
acctaacagc 
tttggcttat 
tcaagttacc 
aatcagtagt 
cttaattttt 



gcaggttcag 
gattccagaa 
aactgccttc 
aaagaagact 
ccttataact 
ctattactac 
tataatagct 
cataacagct 
cccaaacttg 
ggtggccatt 
attgcctgat 
aaataataag 
ctacactatt 
gtatttattt 
ccatcacttc 
attgggtgat 
tggagatcga 
aaggaagaaa 
ttgacttttg 
ttttaaggaa 



gcaacagtag 
ggaaaatatt 
attgcatctc 
ggtataaatc 
cttgccatat 
atttctccat 
gccaaacatc 
atggttataa 
gttgggattg 
caggctagca 
gaacccaaag 
tctgctcaag 
catttgatga 
ggcgctgtgt 
tggaggaaag 
ctgctcggga 
gatggagatg 
atacacgaca 
ccagggtaat 
tttgtgtcaa 



1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2170 



<210> 51 

<211> 1722 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 
<223> Incyte ID No: 



7479974CB1 



<400> 51 

atggatgctg ttaaatactt aaataaattg 
ttgtttttta ctggtaaagg tggcgtaggc 
aacttagcag agaatggaaa gaaagtagct 
caagatgtat ttcaaatgga attatctaat 
ctctctatag ccaatttcga cccgattgtt 
gaaccttatg agggtattct accagaagat 
ggttcatgta cagttgaagt agcagcattt 
actttagaac aagaatttga tttcattata 
agaatgcttg aattaccttc tgcatggaca 
tcttgcttag gtcaattatc aggtttaaat 
gaaaaactac gtaaccaaga tgatacgacc 
tctatatatg aaattcaaag agcgcaacaa 
aaagtaatca ttaacaacta tatagaagaa 
tcggaacaag ataaaaacat taatcatttt 
tacgttccat ataaaaagca gaaagaagaa 
gatgataact taattgaaaa tgatgacttt 
ttaatcgatg aaattgaaaa tagtaaagtt 
gttggtaaaa cgacagtagc aacgcaatta 
gttcttttag caactactga ccctactaaa 
aatactgctt atattgatga agaacaagca 
acagtgaatg atgatacacc acaagacgat 
ccttgtacag aagaaatagc atttttcaaa 
gacatggatt acgtcattgt agacacagct 
tctagtgaaa atcatcatag agaattaaag 
gaaacattat tacctaaaat tcaaaataaa 
gcagaaaaaa caccttattt agaatctaaa 
ataggccata attggtgggt tgttaatcaa 
ctttttagta acaaaaaaga agatgaatca 
tttgacaatt attttgtcat accttatggg 

<210> 52 



aatttagata acattgagtt aacaaaatat 60 
aaaacaacga tatcaagttt tattgcttta 120 
ttagtaagta ctgatccagc tagtaattta 180 
aaattaacta aatatcaacc tatacctaat 240 
gctgcagacg attataaagc acaatctata 300 
gtgcttgctg aaatgaaaga acagttaagt 360 
aatgaattta caaatttttt atccgataaa 420 
tttgatacag ctcccacagg tcacactttg 480 
gattatttaa atacaacgag taatgacgct 540 
gaaaatagag ttaaatataa ttcagcactt 600 
atgatgttag ttgcgagacc tactcactct 660 
gaattacaac aactgtcaat ttctaaattc 720 
agtcacggtt taatttcgag tcagatgaaa 780 
actgaatggt taaataacaa tcatgcttat 840 
ggtatagaaa atttaactaa tctattaaat 900 
attgttgaag atcatccgca attcaataaa 960 
caatatttat ttacaatggg aaaaggtggc 1020 
gctacagcat tatctaataa aggatatcgt 1080 
gaaattaatg ttgaaactac aagtaattta 1140 
ttagaaaagt ataaaaaaga agtactagcc 1200 
attgattata ttatggaaga tttaaaatca 1260 
gcctttagtg acattatgga gaatcaagac 1320 
cctacaggcc ataccttgct gttacttgat 1380 
aaaaaatcaa ctcaaactac cagtaatgtt 1440 
aatttaacac agatgataat cgtaacacta 1500 
cgtttagtag aagatttaaa tagagctaat 1560 
tcgttagtta cgctaaatca acgtgatgac 1620 
ttttggataa acaagattaa aaatgaaagt 1680 
gggttatcat aa 1722 
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<211> 1424 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc__f eature 

<223> Incyte ID No: 7483850CB1 

<400> 52 

ctgatgtcta tttaggaaga aacaaactgg tgtatatttg cttgtttcta cattttctag 60 
gcactgcttt gttatctgtg gtggcttttc ccttggaaga tttctatctg ggcacttacc 120 
atgcagttaa caacatacca aaaacagagc agcacaggct gttttatgta gcactgctga 180 
ccatttgcct tggcattgga ggcgtaagag ccatcgtctg tccactgggt gcttttggcc 240 
ttcaggagta tggatcacaa aaaacgatgt ctttttttaa ctgattcctt caggatatta 300 
tctgcaaact atgaattcca acctgaattt ggatggattt cttctgccga ttgcagtaat 360 
gaatgccatc agcagcctac cattactaat tctggctcct tttctggagt atttcagcac 420 
ctgcctgttt ccctctaaga gagttggatc atttctgtca acatgcatca ttgctggaaa 480 
tctttttgct gcattgtctg tgatgatagc tggcttcttt gaaatacacc gaaaacattt 540 
ccctgcagtg gagcagcccc tttcaggaaa agttctcact gtttcctcca tgccctgttt 600 
ctacctgatt cttcagtatg ttttacttgg agtggcggaa acactggtaa acccagccct 660 
ctctgtaata tcatacagat ttgttccaag caatgtcaga ggaacctcca tgaattttct 72 0 
gacactgttc aatggatttg gctgtttcac aggggcactg ctggtgaagt tggtatatct 780 
catctcagaa ggtaaaaata ggcaatggtt tccaaacaca ttaaacaaag gcaatttaga 840 
aggcttcttc ttcttcctgg catcattaac attgttgaac gtcctgggat tctgcagtgt 900 
ttcacaaaga tattgtaatc taaatcattt taatgcccag aacatccgtg gaagtaatct 960 
tgaagaaaca cttctcctcc acgaaaaatc tctgaaattt tatggcagta tacaggaatt 1020 
ttcttcaagt attgatcttt gggagacagc cctatgaaac tgtgtttgag tctacctgtc 1080 
ttatgagaac agtattcatt gttttcttct tcagttgagc attttatgtg ttttagtgta 1140 
aaagccaata tatacagagg tgattttaca aatatcatct gtatacttta tttacaaaga 1200 
ctaattatga attttctagt aaactgtcat gcttttgcat taacatgtta tagcattaga 1260 
atttaataaa ataatttcaa agtctcagtg attactctat gacaaaaaaa aaaggggggc 1320 
gccgattagt gagcttcgtc gaccgggaaa taattccgga cggacctgca ggcgtaggcg 1380 
atcctgagat tccatcaatg ggggcgtcga gcatgtttaa aggc 1424 

<210> 53 

<211> 3598 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 5508353CB1 

<400> 53 

caatttgcaa cacagtagtg 
cttcactggg ggggttgccc 
ggtctgtccg aagatcaagt 
ttccaaacgc ctttgtgagc 
tggaggaaga ggtctcccag 
gcacagaaac agagaaacaa 
tccctggaca gccattggcc 
ccttagtgta tgccgccagg 
tcatggtgga ctttgctgct 
ttgactcagt aagaaaaaga 
tggtgtatac gaaaggcgct 
atggagcaag tctggagaaa 
atgactatgc caaacaaggc 
ctgaatatgc agagtggctg 
aagaattact acttgaatct 
ctggcattga agaccgtctg 
cgggcatcaa gatctggatg 
atgcatgcaa actactggag 
atgcctgtgg gatgctgatg 
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gtttctgctc ctaaccaacc 
attaagtctt tggaagagat 
tctccatcgc ttaacagtgg 
agactccctc tctttagtcg 
gtgtgtgaga gcccccagtg 
cacggtgatg caggcctcct 
tgcaacctgt gttatgaggc 
gcttaccaat gcactttacg 
ttgggaccat taacatttca 
atgtctgttg tggtccgaca 
gattctgtga tcatggagtt 
caacagatga tagtaaggga 
cttcgtactt tatgtatagc 
aggaatcatt ttttagctga 
gccatgaggt tggagaacaa 
caggagggag tccctgaatc 
ctgacagggg acaagcagga 
ccagatgaca agctttttat 
agcacaattt tgaaagaact 



ccgacaaaag atcagacacc 60 
taaaagtctt ttccagagat 120 
gaaagagcca tcttctggag 180 
aatgaaacca gcttcacctg 240 
ctccagtagc tcagcttgct 300 
gaatggcaag gcagagtccc 3 60 
cgagagccca gacgaagcgg 420 
gtctcggaca ccagagcagg 480 
actcctacac atcctgccct 540 
ccctctttcc aatcaagttg 600 
actgtcggtg gcttccccag 660 
gaaaacccag aagcacttgg 720 
aaagaaggtc atgagtgaca 780 
aaccagcatt gacaacaggg 84 0 
acttacatta cttggtgcta 900 
tatagaagct cttcacaaag 960 
gacagctgtc aacatagctt 1020 
cctcaatacc caaagtaaag 1080 
tcagaagaaa actcaagccc 1140 



WSDOCtD: <WO 024641 5A2 J A> 



tgccagagca agtgtcatta agtgaagatt tacttcagcc tcctgtcccc cgggactcag 1200 

ggttacgagc tggactcatt atcactggga agaccctgga gtttgccctg caagaaagtc 1260 

tgcaaaagca gttcctggaa ctgacatctt ggggtcaagc tgtggtctgc tgccgagcca 1320 

caccgctgca gaaaagtgaa gtggtgaaat tggtccgcag ccatctccag gtgatgaccc 1380 

ttgctattgg tgatggtgcc aatgatgtta gcatgataca agtggcagac attgggatag 1440 

gggtctcagg tcaagaaggc atgcaggctg tgatggccag tgactttgcc gtttctcagt 1500 

tcaaacatct cagcaagctc cttcttgtcc atggacactg gtgttataca cggctttcca 1560 

acatgattct ctattttttc tataagaatg tggcctatgt gaacctcctt ttctggtacc 1620 

agttcttttg tggattttca ggaacatcca tgactgatta ctgggttttg atcttcttca 1680 

acctcctctt cacatctgcc cctcctgtca tttatggtgt tttggagaaa gatgtgtctg 1740 

cagagaccct catgcaactg cctgaacttt acaaaagtgg tcagaaatca gaggcatact 1800 

taccccatac cttctggatc accttattgg atgcttttta tcaaagcctg gtctgcttct 1860 

ttgtgcctta ttttacctac cagggctcag atactgacat ctttgcattt ggaaaccccc 1920 

tgaacacagc cgctctgttc atcgttctcc tccatctggt cattgaaagc aagagtttga 1980 

cttggattca cttgctggtc atcattggta gcatcttgtc ttatttttta tttgccatag 2040 

tttttggagc catgtgtgta acttgcaacc caccatccaa cccttactgg attatgcagg 2100 

agcacatgct ggatccagta ttctacttag tttgtatcct cacgacgtcc attgctcttc 2160 

tgcccaggtt tgtatacaga gttcttcagg gatccctgtt tccatctcca attctgagag 2220 

ctaagcactt tgacagacta actccagagg agaggactaa agctctcaag aagtggagag 2280 

gggctggaaa gatgaatcaa gtgacatcaa agtatgctaa ccaatcagct ggcaagtcag 2340 

gaagaagacc catgcctggc ccttctgctg tatttgcaat gaagtcagca agttcctgtg 2400 

ctattgagca aggaaactta tctctgtgtg aaactgcttt agatcaaggc tactctgaaa 2460 

ctaaggcctt tgagatggct ggaccctcca aaggtaaaga aagctagata ccctccttgg 2520 

agttgcaagt attctttcaa ggttggaaga gggattttga agaggtatct ctccaagcaa 2580 

gaatgacttg tttttccata agggacatga gcattttact aggcttggaa gagctgacat 2640 

gatgagcatt attgtatgtt tgtatataca tttgtgatag agggctagag tttgacctag 2700 

agagagttta aggaagtgaa atatttaatt cagaaccaaa tgcttttgta aaactttttg 2760 

gattttgtaa aagcattttc attctcttag aaattcaagt attttcaagg ggagtcattt 2820 

gagatatatt tattttacta ggagatctta tattctaggg aaatgcttta aatggtcagg 2880 

ctccagtcgg aattttttta agaaaaaagt agtttttaat acattggtta ggactcagag 2940 

gaaatacgga aaaaacattg tagatggtta atttacagat aaaatcccaa gagcctttta 3000 

aacaacaagg tacctaaaat agggtataat tatactgctt aaaatacagt tagtgcctat 3060 

taatagcttt ttatttccta tggggagatg cttttggtct tctggctgaa gatgtaggca 3120 

tacctctcac tcatttcaat gttttcctga ggtggagcct tcattggaaa ggggaaagag 3180 

ggattctagg gtttcatcag ggaccaggaa tgcattcctc tgtcaggttc caatcaagag 3240 

aagacctttt atgagatctg cctctgtata gatgtcgtca gttaggaaac tgaagccata 3300 

ggtcaggaga catcagctca gcctgtggcc atgggtgatt tcctgtattt taaaactgac 3360 

agtagcctga tccaacgtga tatcaatcat tttcaacacc atccttcaga ggacaatgtg 3420 

aggtcattaa gttttacaat tacctggaga ttttcaggtg gtggggcctc taaaattgga 3480 

ttaacacgga actatacagt ggtccggagg tcaaggggtc tcgaaaattt ggtacatctc 3540 

tcggcgggag tcacacagtc cctcggactt gtgcgggtgg cacacattta caggggca 3598 

<210> 54 

<211> 1485 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 8543628CB1 



<400> 54 

gctattggat tctccctata tgagtcgtat 

gacgctgatt tgcgtgctgt tgttcttgcc 

gaggctccag catggccagt tcgggtcttg 

gcaagggcgc tgcgcaaacc ctgtcaattt 

gcggcgtgct ctacggcgtg ctgcgcactt 

gggtctacct ggaactgttc cgcgccattc 

ttggcgtgcc gatcttcttc ggcctgagca 

tgtcgttgtg gggcgccagc gaggtgggcg 

cccgtggcca acgcgaagca ggcttgtcga 

acgtactgct gccccaggcc ctcaggcgca 

gcaccatcaa gaccagctcc ctggcggtgc 



taatttctac tacgaaatgc tcgccgtgct 60 
gttgtcgctg ttgctcagcc gcctggaaag 120 
agttgctctg ggtgtcgttg ccgcaactgg 180 
cgtttttgag catcggcttc agtacggttg 240 
tgaataacaa ggcgatcaac ggcgtgttgc 300 
cggtgctggt gtggttgtac ctgctgtttt 360 
tcccgagttt ctggtgcgcg gtgctggtgc 420 
aagtggtgcg tggcgcgctg cattcgctgc 480 
ttggcctgtc cgacctacag ttgtacggct 540 
tgacgccgcc gaccatcaac gtgtacacgc 600 
tgatcggcgt ggtggatgtg atcaaggtcg 660 
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gccagcaaat 
ttttcttctt 
cccaagcatg 
ggtgctcaag 
cagcggctgt 
cggcagcctg 
acgccaggac 
cgaaacatcc 
aggcggaaaa 
agctttccgg 
tcatgctgtt 
agaattgccg 
atgaggggcg 
gaggtaatta 



catcgagcgc 
tatctgctac 
agcgcattga 
ggcatcgacc 
ggcaaaagca 
cgctttgccg 
attggcatgg 
tgctcggccg 
atgctgggag 
tggccagcaa 
gacgaagttc 
gaggggtacc 
tctcaaaaac 
cgctgcctcg 



acctacgagt 
ccgttgtcgg 
tcgagttcca 
tgagcgtgca 
ccttgttgcg 
gcaaagagct 
tattccagag 
ctgaaagtgg 
cgccgtggcc 
caacgcatcg 
accggggcct 
ctgtgtttcc 
ccggttttta 
gggagaacaa 



<210> 55 

<211> 1470 

<212> DMA 

<213> Homo sapiens 



cggtattgat 
ccgcctccaa 
gggtttcaac 
gcgcggcgaa 
ctgcctcaat 
gctggataaa 
ctaccacctg 
aaaagcgcga 
ttgccgaaag 
catcgtccgt 
cgccgaaatg 
cagaatggtt 
ccgcacgcgc 
cggagacaca 



ctacggtgcg 
gctgctggaa 
aaattcttcg 
gtggtggtga 
gggctggaag 
accaccgact 
tcccgcacat 
ccgcgtggtt 
cgcgacgcct 
cgttgtgcat 
gcaggaatgt 
gccggcggcg 
gactggagct 
cgcgc 



ctgttcctgt 
cggcgctggg 
gcgaggcgca 
tcttaggccc 
tcgcgcacag 
ggcgccaggt 
gagcgtgcct 
gcctgtgagc 
ttcccacgcc 
gaaccctcag 
tggagtgtcc 
cgacagggtg 
tttcccccaa 



720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1485 



<220> 

<221> misc_f eature 

<223> Incyte ID No: 7482754CB1 



<400> 55 

gcctgttctg cagaggagga ggctgggctc agggcacttt tcacgccccc tccgttgggc 60 

tggggacctt ggtaagccat ggagggacag accccaggaa gcaggggcct tccagaaaag 120 

cctcaccctg ccacggctgc tgccactctg tcctcgatgg gcgctgtctt catcctcatg 180 

aagtccgcgc tgggagctgg cctgctcaac ttcccctggg ccttctccaa agcgggcgga 240 

gtggtccctg ccttcctggt ggagctggtc tcgttggtct tcctgatcag cgggctggtc 300 

atcctgggct atgctgctgc tgtcagtggc caggccacct accagggtgt ggtcaggggg 360 

ctgtgtggcc ctgccattgg gaagctgtgt gaggcctgct tcctcctcaa cctgctcatg 420 

atctccgtgg ccttcctcag ggtgatcggg gaccagctgg agaagctgtg tgactccctc 480 

ctgtctggca ccccgcccgc cccgcagccg tggtacgcag accagcgctt caccctgccc 540 

ctgctctccg tgctggtcat cctgcccctg tctgccccgc gggagatcgc cttccagaaa 600 

tacacaagcc cctcacacgg tcactgtgtt agcatcctag gcactctggc tgcctgttac 660 

ctggccctgg tcatcaccgt gcagtactac ctctggcccc agggcctcgt gcgtgagtcc 720 

catccttcac tgagccctgc ctcctggacc tctgtgttca gtgtcttccc caccatctgc 780 

ttcgggtttc agtgtcacga agctgccgtc tccatctact gcagcatgcg caaacggagc 840 

ctctcccact gggccctggt gtctgtgctg tccttgctgg cctgctgcct catctattca 900 

ctgacggggg tttatggctt cctgactttt gggacagaag tttctgctga cgtcttgatg 960 

tcctacccag gcaatgatat ggtcatcatt gtggcccggg tcctttttgc tgtctccatc 1020 

gtaactgtct accccatcgt gctcttcctg gggaggtcag tgatgcagga cttctggagg 1080 

aggagctgct tggggggatg ggggcccagc gccctggccg acccctcagg gctgtgggtc 1140 

cggatgccgc tgaccatcct gtgggtcacc gtgacgctcg ccatggcgct gtttatgcct 1200 

gacctcagcg agatcgtcag catcatcgga ggcatcagtt ccttcttcat cttcatcttc 1260 

ccaggccctg gaggcaaaca ccgtttctgc gaaggatggt ttcttaggaa accttcccaa 1320 

acagcccaag cattatctga gaaggggaag ccagcaaggt ttgtgcctca tctgtgcaat 1380 

gggtgtcgag cctataggac caagagtcaa gtgagaaacc tgcgagccac aggaagagag 1440 

tctctgtcta cccagtatga gggcatctga 1470 



<210> 56 

<211> 3132 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_f eature 

<223> Incyte ID No: 3794818CB1 



<400> 56 

atggagtttg tgcgggcgct gtggctgggc ctggcgctgg cgctggggcc ggggtccgcg 60 
gggggccacc ctcagccgtg cggcgtcctg gcgcgcctcg ggggctccgt gcgcctgggc 120 
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* gccctcctgc cccgcgcgcc tctcgcccgc 

gccctggcgc cgcggctgcc gcacaacctg 
gcccgcgacc ccgcctcgct gacccgcggc 
gcggccctgc tcgcctttcc cgaggctcgg 
gcggccaccg agacccccgt gctcagcctg 
gccccgaacc cattccacct gcagctgcac 
gtgctggtgg cggtgctgca ggcgcacgcc 
actcaggacc ccggcggcct ggtggccctc 
ctggtcctgg acctaagccg gcgggacacg 
ccgatggcgg cgccagtggg gggtgaagca 
gacatcgccc gtgcccgtcg ggtgctggag 
gggacaccac tgccgcccaa ggccctgccc 
ctgggcgagg tggcacgacc cccgctggag 
gcccgggcgc tgggcagtgc ggcccaggtg 
gtcaactgcg gggacctgca gccggccggg 
ttcctggcca acacgtcctt ccagggccgc 
caggtacaca tgtctcggca ctttaaggtg 
ccggcctggg ccacggtggg cagctggcgg 
gcctctgcac ggcccccgcc cccacagggt 
acgctgttgg aacacccatt tgtgtttgcc 
gcggggcagc tgtgcctgga ccctggcacc 
gccgcgctgg ccaacggctc agcgccccgt 
attgacctgc tggagcggct ggcggaggac 
ggtgacggca agtacggcgc cctgcgggac 
ctggccggcc gggcccacat ggcggtcacc 
gtggtggact tcaccagccc cttcttctcc 
gacacggcct cacccatcgg tgcctttatg 
gtctttgcgg ccctgcacct caccgcgctc 
tacggcctca cgccacgtgg ccgcaaccgc 
aacctgtgct acgccatcct cttcagacgc 
acgggccgcc tgctcatgaa cctctgggcc 
acggccaacc tggctgccgt catggtcggg 
cacgacccca agctgcacca cccggcgcag 
agcgccgagg cgtacatcaa gaagagcttc 
agcgcgccca ccacgccccg cggcgtcgcc 
gccttcatca tggacaagtc gctcctggac 
ctgctgaccg tgggaaagcc cttcgccatt 
tcgccgctca cctccaacct gtccgagttc 
gacctgctcc acgacaagtg gtacaagatg 
acagagaccc tgcagatgag catctaccac 
ggcctgggca gcgctctgct cagctcgctg 
ccgcgcatcc gcaaggggag caggctgcag 
cgcgccctca acacggagcc accagagggg 
agcggccccg aggtggagca gcagcagcag 
tggaaacggg cgcgccgggc cgtggacaag 
gccgtggttg tggcacccga agcggacgcg 
gtctggctgt gctcctacgg ccgcccgccc 
ggggagctgc aggagctgga gcgccgcatc 
ctggtgcggc gcggccagct cctggcacag 
cgcttgcttc aggccagagc ggcccccgcg 
agccaggaat ga 



gcccgcgccc gcgccgccct ggcccgggcc 180 

agcttggagc tggtggtcgc cgcgcccccc 240 

ctgtgccagg cgctggtgcc tccgggcgtg 300 

cccgagctgc tgcagctgca cttcctggcg 360 

ctgcggcggg aggcgcgcgc gcccctcgga 420 

tgggccagcc ccctggagac gctgctggat 480 

tgggaagacg tcggcctggc cctgtgccgc 540 

tggacaagcc gggctggccg gcccccacag 600 

ggagatgcag gactgcgggc acgcctggcc 660 

ccggtacccg cggcggtcct cctcggctgt 720 

gccgtacctc ccggccccca ctggctgttg 780 

accgcggggc tgccaccagg gctgctggcg 840 

gccgccatcc atgacattgt gcaactggtg 900 

cagccgaagc gagccctcct ccccgccccg 960 

cccgagtccc cggggcgctt cttggcacgg 1020 

acgggccccg tgtgggtgac aggcagctcc 1080 

tggagccttc gccgggaccc acggggcgcc 1140 

gacggccagc tggacttgga accgggaggt 1200 

gcccaggtct ggcccaagct gcgtgtggta 1260 

cgtgatccag acgaagacgg gcagtgccca 1320 

aacgactcgg ccaccctgga cgcactgttc 1380 

gccctgcgca agtgctgcta cggctactgc 1440 

acgcccttcg acttcgagct gtacctcgtg 1500 

ggccgctgga ccggcctggt cggggacctg 1560 

agcttcagta tcaactccgc ccgctcacag 1620 

accagcctgg gcatcatggt gcgggcacgg 1680 

tggcccctgc actggtccac gtggctgggc 1740 

ttcctcaccg tgtacgagtg gcgtagcccc 1800 

agcaccgtct tctcctactc ctcagccctc 1860 

accgtgtcca gcaagacgcc caagtgcccc 1920 

atcttctgcc tgctggtgct gtccagctac 1980 

gacaagacct tcgaggagct gtcggggatc 2040 

ggcttccgct tcggcaccgt gtgggagagc 2100 

cccgacatgc acgcacacat gcggcgccac 2160 

atgctcacga gcgacccccc caagctcaac 2220 

tacgaggtct ccatcgacgc cgactgcaaa 2280 

gagggctatg ggatcggact gccccagaac 2340 

atcagccgct acaagtcctc cggcttcatc 2400 

gtgccttgcg gcaagcgggt ctttgcggtt 2460 

ttcgcgggcc tcttcgtgtt gctgtgcctg 2520 

ggcgagcacg ccttcttccg cctggcgctg 2580 

tactggctgc acaccagcca gaaaatccac 2640 

tcgaaggagg agacggcaga ggcggagccc 2700 

cagcaggacc agccaacggc tccggagggc 27 60 

gagcgccgcg tgcgcttcct gctggagccc 2820 

gaggcggagg ctgcgccgcg agagggcccc 2880 

gccgcaaggc ccacgggggc cccccagccc 2940 

gaagtcgcgc gtgagcggct ccgccaggcc 3000 

ctcggggaca gcgcacgtca ccggcctcgg 3060 

gaggccccac cacactctgg ccgaccgggg 3120 

3132 



<210> 57 
<211> 1832 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 4717525CB1 

<400> 57 

ggagctgacc ctgcggggtc ccgggggggg agggggagcc gcgaagcccc cactgagtgc 60 
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cgccgctgcc ggggggggag ggggagccgc gaagccccca ctgaggccgc cgctgccggg 120 i 

cctcccctcc cccccgggcg ggcgccatgc gggggagccc gggcgacgcg gagcggcggc 180 

agcgctgggg tcgcctgttc gaggagctgg acagtaacaa ggatggccgc gtggacgtgc 240 

acgagttgcg ccaggggctg gccaggctgg gcgggggcaa cccagacccc ggcgcccaae 300 

agggtatctc ctctgagggt gatgctgacc cagatggcgg gctcgacctg gaggaatttt 360 

cccgctatct gcaggagcgg gaacagcgtc tgctgctcat gtttcacagt cttgaccgga 420 

accaggatgg tcacattgat gtctctgaga tccaacagag tttccgagct ctgggcattt 480 

ccatctcgct ggagcaggct gagaaaattt tgcacagcat ggaccgagac ggcacaatga 540 

ccattgactg gcaagaatgg cgcgaccact tcctgttgca ttcgctggaa aatgtggagg 600 

acgtgctgta tttctggaag cattccacgg tcctggacat tggcgagtgc ctgacagtgc 660 

cggacgagtt ctcaaagcaa gagaagctga cgggcatgtg gtggaagcag ctggtggccg 720 

gcgcagtggc aggtgccgtg tcacggacag gcacggcccc tctggaccgc ctcaaggtct 780 

tcatgcaggt ccatgcctca aagaccaacc ggctgaacat ccttgggggg cttcgaagca 840 

tggtccttga gggaggcatc cgctccctgt ggcgcggcaa tggtattaat gtactcaaga 900 

ttgcccccga gtcagctatc aagttcatgg cctatgaaca gatcaagagg gccatcctgg 9 60 

ggcagcagga gacactgcat gtgcaggagc gcttcgtggc tggctccctg gctggtgcca 1020 

cagcccaaac catcatttac cctatggaga ctctgaagaa ctggtggctt cagcagtaca 1080 

gccacgactc ggcagaccca ggcatcctcg tgctcctggc ctgcggtacc atatccagca 1140 

cctgcggcca gatagccagt tacccgctgg ccctggtccg gacccgcatg caggcacaag 1200 

cctccatcga gggtggcccc cagctgtcca tgctgggtct gctacgtcac atcctgtccc 1260 

aggagggcat gcggggcctc taccggggga tcgcccccaa cttcatgaag gttattccag 1320 

ctgtgagcat ctcctatgtg gtctacgaga acatgaagca ggccttaggg gtcacgtcca 1380 

ggctcgagta cagtggctcg atttcagatc actgcaacct ctgccttcca ggttcaagcg 1440 . 

attctcctgc ctcagcctcc agagtagctg gtattacagg atttcatcat gttgcccagg 1500 

ctcatctcgg actcgtgggt tcaaggaatt cggcagcctt cagccttcca acgtgctggg 1560 

attacaggaa gccggtcgtc atgccatgag cagccttatg gagtggacca tgtggtaagg 1620 

aactcagcca atagccatgt aactgagctt tgaagaggat cttgctgtcc tggccaacat 1680 

ctcactgcga ttctatcagt tgaatttcct ggatagtcca agctttgttg gatccctgca 1740 

ccggtacagc tgggttccag taactgaatc ctgagtctta tgatgtttta cttcaagcat 1800 

gggtcagggg acagcgggtt atgggttcct tg 1832 

<210> 58 
<211> 1902 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: 5091793CB1 

<400> 58 

cccaccttca gtcctggctc 
agcaggtgtc gcgagagttg 
tgcggaacga aagtgaacag 
acattttaga gactgaagag 
ttactatgtt tctcagcagt 
tccaaaagat tgatccgaca 
gtcttggcca aatggtagct 
aagagcctct tattgtctcc 
tccacatccc agcttctcat 
ttggagcagg aaatgtagca 
aaagaacaag ttccatggca 
cagtttttca gacttgtttt 
aactgcagat aaacatgtat 
atattattct gatccttgcc 
gtaaaagtat taattttgaa 
ttgaccaggt tgctgttgtg 
ccctttttga aaccatcatt 
aagctgtgtt atataatggc 
tcttaggagt taagttgctt 
tcatcgttgt atgggttggc 
tacagtggga agatttgcac 
ttggtctttg gaagtctcca 
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tgacaagccc tccagcttca 
ggcgcaagac gccttgtagg 
gagccgctct taggcgacac 
cattataaga gccgatggag 
gtagggtttt ctgtagtgat 
gctgatacaa gttttttggg 
tcacctatat ttggtttatg 
atcttgattt ccgtggcagc 
aataaatact acatgctggt 
gttgttagat catatactgc 
aacataagca tgtgtcaagc 
acattccttg gagaaaaagg 
acaacaccag ttttacttag 
atactaagag aacatcgtgt 
gaagcaagta cagatgaagc 
gccatcaatg ttctgttttt 
actccattaa caatggatat 
ataatacttg ctgctcttgg 
tccaaaaaga ttggcgagcg 
ttctttatct tgttaccttg 
aataattcaa tccctaatac 
atggaagatg acaatgaaag 



cgccacccgg gatgggagaa 60 
gagtgtaact atggccggcc 120 
acctggaagc agagaatggg 180 
atctattagg attttatatc 240 
gatgtccata tggccatatc 3 00 
ctgggttatt gcttcatata 360 
gtctaattat agaccaagaa 420 
caactgcctc tatgcatatc 480 
tgctcgtgga ttgttgggaa 540 
tggtgctact tcccttcagg 600 
attaggtttt attctaggtc 660 
tgtgacatgg gatgtgatta 720 
cgccttcctg ggaattttaa 780 
ggatgactca ggaagacagt 840 
tcaggttccc caaggaaata 900 
tgtgactcta tttatctttg 960 
gtatgcctgg actcaagaac 1020 
ggttgaagcc gttgttattt 1080 
tgctattcta ctgggaggac 1140 
gggaaatcaa tttcccaaaa 1200 
cacatttggg gaaattatta 1260 
accaactggt tgctcgattg 1320 



3NSDOCID: <WO 024641 5A2JA> 



I 



Is.- 



1 aacaagcctg gtgcctctac accccggtga 

tgctaatagg attaggctat ccagtctgca 
ttctaggacc aaaacctcag ggtgtataca 
cccggattct tgggcctatg ttcatcagcc 
cattcagcct ggtgtgtgga ataatagtgc 
aaagactcat tgctctttct gtaagatatg 
gtgatggaaa ctacttgctg tgtggcactt 
gttgggtgca gtgtctcatg cctgtaatcc 
tgcttgaggc caggaattca agaccagcct 
aaaataaaaa caaaaattta aaactgtaaa 

<210> 59 
<211> 2820 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: 5945527CB1 



ttcatctggc ccagttcctt acatcagctg 1380 

atcttatgtc ctatactcta tattcaaaaa 1440 

tgggctggtt aacagcatct ggaagtggag 1500 

aagtgtatgc tcactgggga ccacgatggg 1560 

tcaccatcac cctcctggga gtggtttaca 1620 

ggaggattca ggaataaact agctaagact 1680 

cctggtctaa agctctgcta gacaattgcg 1740 

cagcactttg gaaggccaag gcaggaggat 1800 

gggaaacaaa gtgagaccct gtctttacaa 1860 

aaaaaaaaaa aa 1902 



<400> 59 

gcgcccagct 

ggtcctccct 

gaggcctcat 

ctatcagtat 

atgatactca 

actataagga 

tgttcatcag 

tgctgctcag 

agctctggta 

cctctgtggt 

gcatctggaa 

gggtgaacgg 

gcgtcatcac 

agcaccacgg 

gctctatcag 

agcctgctgc 

tgtgtctgct 

tcgccaatgt 

ttggtggcat 

ccaccacttg 

ttggccagga 

atggcccata 

tgaagggcaa 

taggtgcggc 

tcttctacat 

acaaagagat 

gaggccccaa 

tgggcaagac 

gccagcccag 

actgccaagc 

agggatggga 

tcttgcttgt 

atctctccct 

tctcactcca 

tgtgcatttc 

ttacccaccc 

cttcggtaca 

ccttgcctca 

tgctgtgggg 

gtattttgag 

gagtagacag 

actggcaaga 



ctgtagcctc 
ggctccggga 
cctgctgctg 
cgtcaagagc 
cagtctcaat 
gttactaggg 
tggggtttct 
tggccttttc 
ctttgtggtc 
gacctgtgtt 
ttcccacaca 
gcagtggggc 
cttcctcctc 
tgagccagct 
ggagagcggc 
catcagcttc 
gtttgccaag 
ggctcacttt 
cataggcggc 
ctgtgtcatg 
cgggattgcc 
cgcgctcatc 
cgccaaagcc 
tctggggcct 
gctcatctct 
cttggcctgg 
ttggaacagc 
aatggaaact 
cccagacccc 
atgaggaaat 
ctagggctga 
tcagattcca 
cccatgttaa 
tcaaaggagc 
cccaagctgg 
atcactgctg 
taatttgaag 
ctctgatccc 
agagtagaaa 
aatgaaatgc 
tggacacaag 
cgtgagggca 



ctccgtcgac 
gtctggttct 
acct tcctaa 
cgtctgcacc 
gacaccatgt 
ggcgtggaca 
ggggagcggc 

acctcgctct 
atccaggtct 
ggcaactggt 
tctgtgggca 
ctgtcgttca 
ctcatcgaac 
gagaaccagg 
cttgagactg 
tttggggcgc 
ctggtcagtt 
agtgccaagg 
atcgtggcag 
ctcatcttgg 
agctccatag 
accaccgctg 
ctgtccacgg 
ctgctggctg 
gccgacgtcc 
aaggtgtccc 
agcatggagg 
tccacaagca 
agggctgcct 
agaagattca 
gttgtgtctc 
agacagaagg 
gctttaacct 
ccagcctctc 
gccctcttct 
ttcctccagg 
gggcactccc 
ggccccataa 
gaggggttgg 
caaggagtgc 
aaggacttac 
tgaatggttc 



tcagccttag 
tccgggcctt 
tttacgcctg 
agaactgctc 
ggtgcagctg 
acgccttcct 
ttccgctccg 
ttggcctggg 
gtaatggact 
tcgggaaggg 
acatcctggg 
tcgtgcctgg 
acccagaaga 
acaaccctga 
tggccaaatg 
tccggatccc 
acaccttcct 
aggctgggga 
ggctcgtctc 
ctgcccccat 
tgatgctgat 
tctctgctga 
tcacggccat 
ggctcatctc 
tagcctgctt 
tgagcagagg 
gtcccagttg 
ggggaggcaa 
aaggacacag 
ggggcctgag 
cattttgata 
cttcacaagg 
ctgtaatctg 
tttgtccctc 
actctccatt 
gccagcactc 
tcttgggcac 
tgtcctcagt 
catgactaaa 
ctactgtatg 
gccctgagca 
attcaggcag 



gtaccggtca 
ctccagggac 
ctatcacatg 
ggagcagatc 
ggccccattt 
catcgcctat 
ttactacctc 
atatttctgg 
cgtccagacc 
gaagcggggg 
ctccctgatc 
catcattact 
tgtggactgc 
ggaccctggg 
ctccaagggg 
aggcgtggtc 
ctactggctg 
cctgtctaca 
tgactacacc 
gatgttcctg 
catctgtggg 
tctggggact 
cattgacggc 
ccccacgggc 
gctcctttgc 
cagcgggtat 
ggtccccaac 
accctcttta 
agattctcca 
ctctggaagc 
aggaaaggat 
ccaacgcctg 
cctgtatcta 
taccatgcaa 
taggcctgtt 
gggcgaggca 
atgccggccc 
ggaaggtgac 
aataccagta 
ccagctctgc 
caggtgccaa 
ctgctgcagg 



ggcaaaatgc 
agctggttcc 
tccaggaagc 
aaacccatca 
gacaaggaca 
gccatcggca 
tcagctggaa 
aacatccacg 
acaggctggc 
ttcatcatgg 
gccggcatct 
gccgtcatgg 
gcccctcctc 
aacagtccct 
ccatgcgaag 
gagttctctc 
cccctctaca 
ctcttcgatg 
aatggcaggg 
tacaactaca 
ggcctggtca 
cacaagagcc 
accggctcca 
tggaacaatg 
cggttagtat 
aaagaaatat 
gtgctcccca 
ttgaacatta 
tgggaagggg 
tgcaagcaaa 
atgctcacac 
gaaaatgggc 
taggtgggca 
cagftcttctc 
gataactcca 
ggggagctgc 
tgagtgcctc 
tgggggccgg 
tgtgtattaa 
tctaggaatg 
cggtgacaag 
tgtggtcacc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 
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tggtgccatc tgctgctccc ttttccactt 
gatcactcgc tgttttctgg ctagctcttg 
tggcaccaat agattctgtt tgacctgctg 
ttggcagcga gtgaggctgg catggggtgc 
gtgctggcag tcctcacagc cctcgctcgc 



ttctatgtcc tccttccacc ccaagtcccg 2580 
gcatctccat ctgagcctaa agttgcccac 2640 
tgcccatgct catctttgtc ggggcttggc 2700 
tgaagtgggg tctcagtgag gggtgacagc 2760 
tctcggcgcc tcctctgcct gggctcccac 2820 



<210> 60 

<211> 3920 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_f eature 

<223> Incyte ID No: 6941124CB1 



<400> 60 

ccggccgtga gtgcgcgagt gtgccatggc 
gccgtgcggt gggtgcctga acccgggggc 
ccgtctcccg tcggggaccc cggggagaac 
ccggtgggcg agagggctgc gggattccca 
gcccgggcgc ggcttcgcgg ccaccgcgcg 
tcttgaattt cttaccaggt ttgaacagct 
aagtgggcag ggcggagctg tggggctccc 
tggcgggccg ccgcctcgga aggtctcatt 
gcgccgcggt gggagccccg gaacccctgg 
gccctgcgcg gccgcagcgg tttctccctt 
cccgccagga cccaggtttg tacgaccctt 
ggagtgtgca ccggcccaac cctcccagac 
agcgccccct agcagatggg ctaagcaatt 
gctatggggc cccctcccga gcactgtccc 
gttgcatccc tcggaagagc gaaggaatga 
cacaggaggt ccagggcttg gtccatgggg 
actctcccac catgtcgcag aatggatact 
cagatgacac cttcgaagcc cgagaggaga 
actcctccat cgtgtctggc gagagtatcc 
ggcccacaac actgagaatg aagcgacttc 
aaagtacctg aagttaaaga acttcaagga 
cttcctggcg caggccagca tcgtcctgaa 
gcggaccatg cttcgccgct tcgcccggga 
ggacctgctc -atggccatgc tcttcaccga 
cctgctgtca gataccatcc aaggggtcac 
gtcgtggctc tgcatcatct gtaccatgaa 
ccgcctggtt cgcccacaga actgggggga 
ggtgctggcc ccacccaaga tgaaaagcac 
tgccaccatg ttctcggata tcgccttccg 
ggaattcaag gaggccttgg tgcatcagag 
agtggcgccg agaacgaagg aacgcagcac 
gcccccaaag tgcaaggact ttgtcccttt 
caggttcccc ttgtacccct tggacttcac 
gggcaaatac atcaccacca ccctgttcct 
tttcgggtct ctcaatgacg agaacacaga 
cgggcagagc atcgggggcc tgctctacgc 
gctgaccacc gcgcccctgg cgctctacat 
tgacctggac ttcaactcct tctacgcatg 
gctttatgcc tttttcaacc tcagcctggt 
gatcatcgcc ctcttcattt ccatcacgtt 
aatcttctgg aagtactact atgggcatta 
atcccttgtc agcctgtcag gcctcggcgc 
caatgccagc ttcctcgcca gccccacgga 
gaccgccgtg ctcagcctcc tcatcatgct 
ccaattcaag aagagcccct acctgcaccc 
cctgcccatc gcggtgctcg ccttctccct 
gatgagcaag ttccgctaca accccagcga 



cgcggccacc aggcgcgtgt tccatctgca 60 
gcggcgggaa ggccgcccgg gtctccgcca 120 
gcggatgccg tctgcctgcg cgggccgggg 180 
gccgcgggga ggagggcgcg ggtgcgggag 240 
cttggtttca ttagggctct gctgtcgccg 300 
gatgcagcgg gtcgagggcg gggagctcga 360 
agcgccccga gcgccgagct tttgggagcc 420 
ttaccttccg cccgggcatc ttcccgtccg 480 
agttttggga gtcccgggac . ccctgaccgt 540 
gctctgacac gggagggcca gccctgggtc 600 
tacctgcacg tgggccgccg ggacggaagc 660 
cctcgtcgcc cacggcctgg gacgcgcccc 720 
gaatttctga gattaaggct ggcttcccct 780 
ctgcttccct ttctccctag ggtggcgtgg 840 
gccaggtcgg ggggcgggga gacaggtgca 900 
ctggtgacct ttctgcttcc cttgcagaaa 960 
tcgaggattc aagctactac aagtgtgaca 1020 
tcctggggga tgaggccttc gacactgcca 1080 
gtttcttgtc aatgtcaacc ttgagatgca 1140 
cggtggctgt gtgctcctgc acacctcccg 1200 
agagatccgt gcgcaccgcg acctagatgg 1260 
cgagacggcc acctccctgg ataacgtgct 1320 
ccctgacaac aatgagccca actgcaacct 1380 
tgccggggca cccatgcggg gtaaagtcca 1440 
cgccacagtg acaggggtgc ggtaccagca 1500 
ggccctacag aagcggcacg tgtgcatcag 1560 
gaattcctgt gaggttcggt tcgtcatcct 1620 
taagactgcg atggaggtgg cgcgcacgtt 1680 
ccagaagctc ctggagaccc gcacagagga 1740 
acagctgctc accatggtga gccacggtcc 1800 
agtctccctc cctgcccaca gacacccaga 1860 
tgggaagggc atccgggagg acatcgcacg 1920 
tgatggcatt attgggaaaa acaaggctgt 1980 
ctacttcgcc tgcctcctgc ccaccatcgc 2040 
cggggccatc gacgtgcaga agaccatagc 2100 
gctcttctct gggcagccat tggtgattct 2160 
ccaggtgatt cgtgtcatct gtgatgacta 2220 
gacgggcctg tggaatagtt tcttccttgc 2280 
catgagtctc ttcaagaggt cgacggagga 2340 
tgtgctggat gccgtcaagg gcacggttaa 2400 
cttggacgac tatcacacaa aaaggacttc 2460 
cagcctcaac gccagcctcc acactgccct 2520 
gctgccctcg gccacacact caggccaggc 2580 
gggcacgctc tggctgggct acaccctcta 2640 
ctgcgtgcga gagatcctgt ccgactgcgc 2700 
catcagctcc catggcttcc gggaaatcga 2760 
gagccccttt gcgatggcgc agatccagtc 2820 
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gctgtccctg agggccgtca gcggtgccat gggcctcggc ttcctgctgt ccatgctctt 2880 

cttcatcgag cagaacttgg tggccgcctt ggtgaatgca ccggagaaca ggctggtgaa 2940 

gggcactgcc taccactggg acctcctgct cctcgccatc atcaacacag ggctgtctct 3000 

gtttgggctg ccttggatcc atgccgccta cccccactcc ccgctgcacg tgcgagccct 3060 

ggccttagtg gaggagcgtg tggagaacgg acacatctat gacacgattg tgaacgtgaa 3120 

ggagacgcgg ctgacctcgc tgggcgccag cgtcctggtg ggcctgtccc tgttgctgct 3180 

gccggtcccg cttcagtgga tccccaagcc cgtgctctat ggcctcttcc tctacatcgc 3240 

gctcacctcc ctcgatggca accagctcgt ccagcgcgtg gccctgctgc tcaaggagca 3300 

gactgcgtac cccccgacac actacatccg gagggtgccc cagaggaaga tccactactt 3360 

cacgggcctg caggtgcttc agctgctgct gctgtgtgcc ttcggcatga gctccctgcc 3420 

ctacatgaag atgatctttc ccctcatcat gatcgccatg atccccatcc gctatatcct 3480 

gctgccccga atcattgaag ccaagtactt ggatgtcatg gacgctgagc acaggccttg 3540 

actggcagac cctgcccacg ccccattcgc cagccctcca cgtcctccca ggctggctct 3600 

ggagctgtga ggggaggtgt aggtgtgtgg gtgactgctc tgtgctgcgc cttctcatgg 3660 

ctgactcagg cctggggcat ctgggcattg taggggtgca gtggtatgtg cccacccctc 3720 

tcccattatc ctttagcttt aggccaagag cgttgctcag ggcagcttct gcccagggtg 3780 

ggtgggactg agcaggatgg attttctttt gataaaagag tcgatgcctg aaagagaaac 3840 

catttccttg attgtgtaag gaacttgctg gacgcacatt agagaataaa gctcctgttt 3900 

ctaggctcct aaaaaaaaaa 3920 

<210> 61 

<211> 1333 

<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: 6972530CB1 

<400> 61 

tggggcaacc tgctaacagc cgctgttgtg gctttgctgt atggcggact ggtcgtgggc 60 
ttccttccgc ggcagggcgt ttccttcgaa gggcatctgt ttggtgcgct cgccggattt 120 
attgcagcca aggtacttct atccacgccg cgcagcgggt ttaatgccgg gtgaacgcac 180 
ttgagcatcc agacgccgag gcgatgctgt cgtcaggatt caagtaccca gggagccagg 240 
ggctaaatgt cgattttcct actgctacgc atgtacgcct cgtccctctt tcatcgcttc 300 
ggttgggcag gcctgatcgt tgccttgggt gtccacctga gcaccgctta cctgggcctg 360 
gtgttgctgg gcgaacaaca cctcaccgct gccgccacgt ttatctactt ctatctcacc 420 
accacactca ccgtcggcta tggcgacctc gcgccgcaga cgtctgctgg gcgaattttc 480 
gtggcggcct gggtcatgct cgggggcatt gcgctgctca cggcagccat cggcaaaacc 540 
accagcagtg ttatcgatgc atggagaaaa ggcatgaagg gcaaaggtga tttcaccggc 600 
aaggtcggcc ataccgtgtt gatcggctgg gagggcgcgt ccagcgagcg ggtgattgag 660 
ttgttgctgc aagacgaaac ctcgaacgac aacctgatcg tgatctgcga ttgcagcctt 720 
gaagaaaatc cgatgccggg caaagcagcg ttcatccgcg gcgaaagcct gtcttccacc 780 
gcactgttgc tgcgtgccgg tgtgcccggt gccgagcgtg tgctggtgcg aaccccgtct 840 
gatgacctga cgctggccac cgtgctggcg gtgaaccaac tgagtcccgt cgggcatgtg 900 
gtcgcccact tcaacgaaag tgaaatcgcc gcactcgcca gctcctacgc cccgcgcctg 960 
gagtgcacgt ccagcatggc aattgaaatg ctggtgcgcg cctctcagga tccgggctcg 1020 
tcggtggtca tcaatgaatt gctgtgcgtg gggcagggcg ccacccagta cctgatgaaa 1080 
ttgcccgaag cctttgaggc aacattcggc gagctgtaca cgcagatgaa ggagcgccac 1140 
aacgccaccc tcattggcta tcgcgccaaa ggtgttcagc agccgtcgat caacccgccc 1200 
agtgccacag aggtcaaagg cggcgaactc ttctacatcg cttccactcg ccttaaggaa 1260 
atctcccatg ggatggctta aacagttgat ggggcttgag gccccaaaag ccactaccga 1320 
ctcgagccga acc 1333 

<210> 62 
<211> 8487 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 6991750CB1 
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<400> 62 

atgcacctat taataattta cattcaattt 
tgtgtggttg agagaaccag aggggctctg 
cagattgaaa ctgatggcat taattacctt 
attacattcc ttccttggca gagatcagag 
attcctgaac ttaatgagta tttccgtgtg 
aagctaggct caactcctac cagtggtgca 
atcaccatca aagctagtga tcatccatat 
cctcagccta aggacgcaat gaccctgcct 
gaggaagatg gagaaatcag gttattggtc 
actgcggaat ttagaacagt gtccttgaca 
gctggcacat tagaatttca accaggagaa 
gataattcta ttcctgaact ggaaaaatct 
ggagccagtc taggagtggc ttcccaaatt 
catggcgtat ttgaatttag ccctgagtca 
gggtatagca ctgttacatt aaatgttata 
ttgcattgga acatagactc tgatcctgat 
acatttgaga ttgggcagac gagcgccaat 
ccagaactgg ataaggcatt ccctgtgtca 
gctcatatta atgccacgtt aacagttttg 
ttttctgaga aaaacagacc tgttaaagtt 
ataataaggt tgaaaggcct catgggaaaa 
atggaaaaac caccttattt tccacctaat 
ataccagctt ctggatttgc tctttttgga 
tcaattttgg atgatgatga gccagaaagg 
tctactttag tagcgaaagt. acagagtcgt 
aaggtagaaa ctattgcgca actaattatc 
cagctctcag caccaattgc ccgagtggca 
actagaacag gaggagcatt tgcagatgtc 
gcaatagctg gtgaagatta tagtatagct 
accagtaaag ccgtgccaat atatgtcatt 
tttcttgtgc aactgatgaa tgaaacaaca 
gcagtcatta ttattgaggc ctctgatgac 
aaacttattg tagaggaacc tgagtttaac 
aattctggga cactcggcaa tgttactgtt 
gctactggcg acctgcgagt tgtctcaggt 
caaaccttgt tgttagaggt. cctggctgac 
gtgcaactaa ctgatgcctc tggtggaggt 
attattcctg ccaatgatga tccttatggt 
gttcaagagc ctctggaaag aagctcctgt 
cactttggtc ggctgttgtt gttctacagt 
atggaggaag gtcaagattt actgtcctac 
ccactttgga gaacttggat gaatgtctct 
actttgtgcc ttaaggaaca agcttgctca 
ccccagtgtt tctggatgac atcatggatc 
acctacagga aaaacatgac cagggtagca 
agtgactatg agcctgtgac aaggcaatgg 
aatctcacag tgtctattct tcctgatgat 
tctctccttg aagttcacct catgaacatt 
ggacagccaa atatttctac agttgtcata 
gtgatctaca atattagtcc caatacttcc 
cagccccaaa ccttggtgga gctgatgata 
gcagtcgaat ggcgtgttgt tggtggaaca 
ggagagattc tgacctttgc tgaaggtgaa 
gatgactctg aaccagagga tgacgaaagt 
ggaagtagaa ttttgccaag ctccgacact 
gtggcaggaa ttgttagctt tcagacagct 
attttacaat tccatgtgat aagaactttc 
aaaattattg ggcaaaatca agaactcaat 
cctgaggggt cgttgaatac aacattgttt 
gagaaagaag tataccaagt cattctgtat 
ggaatcgccc tgcttgatgc tcaaggatat 
gaaccacatg gagttttaaa ttttgctctt 



cagattgcag aggagggatc aaccatttct 60 
gattatgtgc atgtttttta caccatttca 120 
gttgatgact ttgctaatgc cagtggaact 180 
gttctgaata tatatgttct tgatgatgat 240 
acattggttt ctgcaattcc tggagatggg 300 
agcatagatc ctgaaaagga aacgactgat 3 60 
ggcttgctgc agttctccac agggctgcct 420 
gcaagcagcg ttccacatat cactgtggag 4 80 
atccgtgcac agggacttct gggaagggtg 540 
gcattcagtc ctgaggatta ccagaatgtt 600 
agatataaat acattttcat aaacatcact 660 
tttaaagttg agttgttaaa cttggaagga 720 
ctagtgacaa ttgcagcctc tgaccacgct 780 
ctctttgtca gtggaactga accagaagat 840 
agacatcatg gaactctgtc tccagtgact 900 
ggtgatctcg ccttcacctc tggcaacatc 960 
atcactgtgg agatattgcc tgacgaagac 1020 
gtcctcagtg tttccagtgg ttctttggga 1080 
gctagtgatg atccatatgg gatattcatt 1140 
gaggaagcaa cccagaacat cacactatca 1200 
gtccttgtct catatgcaac actagatgat 1260 
ttagcgagag caactcaagg aagagactat 1320 
gctaatcaga gtgaggcaac aatagctatt 1380 
tccgaatctg tctttatcga actactcaac 1440 
tcaattccaa attctccacg tcttgggcct 1500 
attgccaatg atgatgcatt tggaactctt 1560 
gaaaatcatg ttggacccat tatcaatgtg 1620 
tctgtgaagt ttaaagctgt gccaataact 1680 
tcatcagatg tggtcttgct agaaggggaa 1740 
aatgatatct atcctgaact ggaagaatct 1800 
ggaggagcca gactaggggc tttaacagag 1860 
ccctatggat tatttggttt tcagattact 1920 
tcagtgaagg taaacctgcc aataattcga 1980 
cagtgggttg ccaccattaa tggacagctt 2 040 
aatgtgacct ttgcccctgg ggaaaccatt 2100 
gacgttccgg agattgaaga ggttatccaa 2160 
actattgggt tagatcgaat tgcaaatatt 2220 
acagtagcct ttgctcagat ggtttatcgt 2280 
gctaatataa ctgtcaggcg aagcggaggg 2340 
acttccgaca ttgatgtagt ggctctggca 2400 
tatgaatctc caattcaagg ggtgcctgac 2460 
gccgtggggg agcccctgta tacctgtgcc 2520 
gcgttttcat ttttcagtgc ttctgagggt 2580 
agcccagctg tcaacaattc agacttctgg 2640 
tctcttttta gtggtcaggc tgtggctggg 2700 
gccataatgc aggaaggtga tgaattcgca 2760 
ttcccagaga tggatgagag ttttctaatt 2820 
tcagccagtt tgaaaaatca gccaaccata 2880 
gcactaaatg gtgatgcctt tggagtgttt 2940 
gaagatggct tatttgttga agttcaggag 3000 
cacaggacag ggggcagctt aggtcaagtg 3060 
gctactgaag gtttagattt tataggtgct 3120 
accaaaaaga cagtcatttt aaccatcttg 3180 
atcatagtta gtttggtgta cactgaaggt 3240 
gttagagtga acattttggc caatgacaat 3300 
tccagatctg tcataggtca tgagggagaa 3360 
cctggtcgag gaaatgttac tgttaactgg 3420 
tttgctaact ttagcggaca acttttcttt 3480 
gtgcatttgt tggatgacaa cattcctgag 3540 
gatgtcagga cacaaggagt tccaccagcc 3600 
gcagctgtcc tcacagtaga agccagtgat 3660 
tcatcaagat ttgtgttact acaagaggct 3720 
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I aacataacaa ttcagctttt catcaacaga gaatttggat ctctaggagc tatcaatgtc 3780 

acatatacca cggttcctgg aatgctgagt ctgaagaacc aaacagtagg aaacctagca 3840 
gagccagaag ttgattttgt ccctatcatt ggctttctga ttttagaaga aggggaaaca 3900 
gcagcagcca tcaacattac cattcttgag gatgatgtac cagagctaga agaatatttc 3960 
ctggtgaatt taacttacgt tggacttacc atggctgctt caacttcatt tcctcccaga 4020 
ctagattcag aaggtttgac tgcacaagtt attattgatg ccaatgatgg ggcccgaggt 4080 
gtaattgaat ggcaacaaag caggtttgaa gtaaatgaaa cccatggaag tttaacattg 4140 
gtagcccaga ggagcagaga acctcttggc catgtttcct tatttgtgta tgctcagaat '4200 
ttggaagcac aagtggggct ggattatatc ttcaccccaa tgattcttca ttttgctgat 4260 
ggagaaaggt ataaaaatgt caatatcatg attcttgatg atgacattcc agaaggagat 4320 
gaaaaatttc agctgatttt aacaaatcct tctcctggac tagagctagg gaaaaataca 4380 
atagccttaa ttattgtcct tgctaatgat gacggccctg gagttctatc atttaacaac 4440 
agtgagcact ttttcctaag agagccaaca gctctctacg tccaggagag tgttgcagta 4500 
ttgtacattg ttcgggaacc tgcacaagga ttgtttggaa cagtgacagt tcagttcatt 4560 
gtgacagaag tgaattcctc aaatgaatct aaagatctga ctccttccaa aggctatatt 4620 
gttttagaag aaggtgttcg attcaaggcc ctacaaatat ctgccatatt agacacggaa 4680 
ccagaaatgg atgagtattt tgtttgcacc ttgtttaatc caactggagg tgctagacta 4740 
ggggtgcatg ttcaaaccct gataacagtt ttgcaaaacc aggccccttt ggggctattc 4800 
agtatctctg cagttgaaaa tagagccacc tccatagaca tcgaagaagc caataggacc 4860 
gtgtatttaa atgtatctcg aactaatggc attgatttgg ctgatttaaa tatagaaaat 4920 
ccgaaaactt gtgaggcctt taatattggt ttttctccct actttgtgat tactcatgaa 4980 
gaaagaaatg aagaaaagcc ttctcttaac agtgtgttta cattcacatc tggatttaaa 5040 
ttattcctgg tacaaacaat cattattctg gaaagttctc aagtaagata ttttacttca 5100 
gacagccaag attatttaat cattgcaagt caaagagatg attccgaatt aactcaggtc 5160 
ttcaggtgga atggaggaag cttcgtgttg catcaaaaac tccctgtccg aggtgtgctg 5220 
accgtggcct tgttcaacaa gggaggctct gtgttcttag ccatttccca ggctaatgcc 5280 
aggctaaact cccttttatt cagatggtct ggcagtgggt ttattaactt tcaagaggtg 5340 
cctgtcagtg ggacaacaga agttgaggct ttgtcttcag ccaatgatat ttacctaata 5400 
tttgccaaaa atgtctttct aggagatcag aattcaattg atattttcat ctgggagatg 5460 
ggacagtctt ccttcaggta ttttcagtct gtagattttg ctgctgttaa cagaatccac 5520 
tccttcacac cagcctcagg aatagcccac atacttctta ttggccaaga tatgtctgct 5580 
ctttactgct ggaattcgga gcgtaatcaa ttctcttttg ttctggaagt accttctgct 5640 
tatgatgtgg cttctgttac agtaaagtcc cttaattcaa gcaagaattt aatagctcta 5700 
gtgggagctc attcacatat atatgagcta gcctacattt ccagccattc tgactttatt 5760 
cctagttcag gtgaactgat atttgaacct ggtgagagag aagctacaat agcagtaaat 5820 
atccttgatg atacagttcc agaaaaagaa gaatccttca aagttcaact taaaaatccc 5880 
aaaggaggag cagagattgg cattaatgat tctgtaacaa taaccattct gtctaatgat 5940 
gatgcctatg gaattgttgc atttgctcag aattcattat ataagcaagt ggaagaaatg 6000 
gagcaagata gcctagtaac cttgaacgtt gaacgcttaa aaggaacata tggccgtata 6060 
accatagcat gggaagctga tggaagtatt agtgatatat ttcctacctc aggagtggtg 6120 
gaaaaacgaa tgtcagcaaa gattttattt actgaaggcc aggtactgtc aacaatcact 6180 
ctaactattc ttgctgataa tataccagag ttatcagagg ttgtgattgt aaccctcacc 6240 
cgtatcacca cagaaggggt tgaggactca tacaaaggtg ctactattga tcaggacaga 6300 
agcaagtctg ttataacaac tttgcccaat gactcacctt ttggcttggt gggctggcgt 6360 
gctgcgtctg tcttcattag agtagcagag cctaaagaaa acaccaccac tcttcagtta 6420 
caaatagctc gagataaagg actacttggg gatattgcca ttcacttgag agctcaaccc 6480 
aatttcttac tgcatgtcga taatcaagct actgagaatg aagattatgt attgcaagaa '6540 
acaataataa taatgaaaga aaacataaaa gaagctcatg ccgaagtttc cattttgccg 6600 
gatgaccttc ctgaattgga ggaaggattt attgtcacta tcactgaggt gaacctggtg 6660 
aactctgact tctctacagg acagccaagt gtgcggaggc ccggaatgga aatagctgag 6720 
ataatgatag aagaaaatga cgatcccaga ggaattttta tgtttcatgt tactagaggc 6780 
gctggggaag ttattactgc ctatgaggtg cctccaccct tgaacgttct tcaagttcct 6840 
gtagtccggc tggctggaag ctttggggca gtaaatgttt attggaaagc atcaccagac 6900 
agtgctggcc tggaagactt taaaccatct catgggattc ttgaatttgc agataaacag 6960 
gttactgcaa tgatagaaat caccataatt gatgatgctg aatttgaatt gacagagacg 7020 
ttcaatattt ccttgatcag tgttgctgga ggtggcagac ttggtgatga tgttgtggta 7080 
actgttgtta ttccacaaaa tgattctcca tttggagtat ttggatttga agaaaagact 7140 
gtaatgattg atgaatccct ttcatccgat gaccctgatt catatgtgac attgacggtt 7200 
gtccggtccc caggaggaaa aggaaccgtc cgacttgagt ggaccataga tgagaaggct 7260 
aaacataacc ttagtccttt gaatgggacc cttcattttg atgagactga gtcccagaag 7320 
accattgtgt tgcacacact tcaagacaca gtgttggagg aggacaggcg tttcaccatt 7380 
cagctgatat caattgatga ggtagaaata tctccagtaa aaggtagtgc atcaataatt 7440 
attcggggtg ataagcgagc atcaggagaa gttgggatag ctccgtcatc taggcacatc 7500 
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ctcattgggg aaccctcagc aaaatataat ggtaccgcta ttatcagcct tgttcgaggc 7560 
ccagggattt tgggggaggt cacagtgttc tggaggatat tccctccttc cgtgggggaa 7620 
tttgctgaaa catcaggaaa actgacaatg cgagacgaac agtctgcagt cattgtagta 7680 
atacaggctt tgaacgatga cattcccgag gaaaaaagct tctatgagtt tcagctcact 7740 
gcagtcagtg agggaggagt tctgagtgaa tccagcagca ctgccaacat cacggtggtg 7800 
gccagcgact ctccctatgg ccgatttgcc ttttcacatg agcaacttcg agtgtcagaa 7860 
gcacagaggg ttaacatcac aatcatccgt tccagtggag attttggcca tgtgcgactc 7920 
tggtacaaga cgatgagcgg gacagcggaa gcaggcttgg attttgttcc tgcagcaggg 7980 
gagctcctct ttgaagcagg ggagatgagg aaaagtctgc atgttgaaat ccttgatgat 8040 
gactatcctg aaggcccaga ggaattttct ctaacaatta caaaggtgga actccaggga 8100 
aggtaaagga gaaaggcaat taggaaaaag aaagcaaaga gcagagagaa agagagaaag 8160 
ttctattgtg tagagattaa tactgagctc taaagcagtt ctttccctac tctttctcag 8220 
atttgtgtga cattagaaca ggacatacaa gggaaattat gacaatgact gagttttttt 8280 
ttttaatttt gttaaatata acttccctct gaaaattgaa agtacgtgtt tctattgtgt 8340 
ggacacactc tgggtagcca ggaagttgaa tatgataacg gcttctaatt ggtgaagaca 8400 
cagcctaaca gctctctaaa tcagaggtca gctggtgtgg ctgaatttgc ttcttttgat 8460 
ttccatactt gtgaaagggt gaaggga 8487 

<210> 63 
<211> 3264 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: 71726948CB1 

<400> 63 

cgctggaagc agctggaggc tccagttctc atctgctcag gtgtccccgg cgccttggcg 60 
aactcggcca ctccagttcc tcacgtggtg agcactcagg gcagcgggtc gattttccga 120 
ggtcccatac ctgggtttga ggggcgcggc tcgcagcggc gggtgcaggg gcgactgcca 180 
gccctcaccc cgcctcgggg tgcgttcgga ggccgacacc tggaggacgc ctccagtccc 240 
cgcgggacgc cacgcctgcg cgccagggat ccgggataag aagtgcgcgc cgggctccgg 3 00 
ctgcgcgccg cggggccacc agtttgcgcg cagggctcag gcgaccgtgc ggccatggac 3 60 
acgccacggg gcatcggcac cttcgtggtg tgggactacg tggtgttcgc gggcatgctg 420 
gtcatctcgg ccgccatcgg catctactac gccttcgctg ggggcggcca gcagacctcc 480 
aaggacttcc tgatgggcgg ccgcagaatg accgcagtgc ccgtggcgct gtccctcacc 540 
gctagcttca tgtcagccgt cactgtcctg ggcaccccct ccgaggtcta ccgttttggg 600 
gccattttta gcatctttgc cttcacctac ttctttgtgg tggtcatcag cgcggaggtc 660 
ttcctcccgg tgttctacaa actgggaatt accagcacct acgagtattt agaacttcga 720 
tttaacaaat gtgttcgtct ctgtggaaca gtcctcttca ttgttcaaac aattctgtat 780 
actggaattg ttatttatgc ccctgccctg gctttgaatc aagtcacagg atttgatctg 840 
tggggcgcgg tagtggcaac gggggtggtc tgcacattct actgcacact gggtggtctt 900 
aaagcagtta tctggacaga tgtttttcaa gttgggatca tggtggctgg atttgcatcc 960 
gtgattatac aggctgtggt gatgcaaggt ggaatcagca ctattttaaa tgatgcctat 1020 
gatggtggaa gattaaattt ctggaatttt aatcctaacc ctttgcaaag acacaccttc 1080 
tggacaatta ttataggagg gaccttcaca tggaccagca tctacggtgt caaccaatcc 1140 
caggtgcaga gatatatttc ttgtaaaagc agattccagg caaaactgtc tctctacatc 1200 
aatcttgtgg gactctgggc aatcctcaca tgctcagtgt tttgtgggct cgccctatat 1260 
tccaggtacc atgactgtga tccttggaca gccaagaaag tgtctgcacc agaccagctc 1320 
atgccttatt tggtactgga cattctgcaa gattatccag gacttcctgg actttttgtg 1380 
gcctgtgctt acagtgggac attaagcaca gtgtcctcca gtattaatgc cttagcagca 1440 
gtaactgtgg aagatctaat caaaccttac ttcagatcgc tctcagaaag gtctctgtct 1500 
tggatttccc aaggaatgag tgtggtgtat ggagccctgt gtattggaat ggctgcgctg 1560 
gcgtcactta tgggagcttt gttgcaggca gcactcagcg tatttggtat ggttggtgga 1620 
ccacttatgg gcctgttcgc tttgggcatt ttggttccct ttgccaactc aattggagca 1680 
cttgttggtc tgatggctgg atttgccatt tctctatggg ttggaattgg agctcaaata 1740 
tatcctccac ttcctgagag aacattgcca ttgcaccttg atatccaagg ctgtaacagc 1800 
acctacaatg agacaaattt gatgacaacc acagaaatgc catttactac tagtgttttt 1860 
caaatataca atgttcaaag gactccactg atggataact ggtattcttt atcatatctg 1920 
tacttcagca ctgttggaac tttggtaaca ttattagtgg ggatacttgt cagtttatca 1980 
acaggaggaa gaaaacagaa cttagacccc agatatatac taaccaaaga ggacttttta 2040 
tccaattttg atatttttaa gaaaaagaag catgttttga gctataaatc acatccagtg 2100 
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gaagatggtg 
ggcaagagca 
tttcaatttt 
gttgtatcat 
tttactacat 
tctctttcct 
aaggctaata 
gtattattga 
catcacaatt 
agcaatgagg 
agggcatata 
ctggctggga 
ttgtgccttt 
tagactgtaa 
cactctaact 
caagacactc 
attatgaact 
atgatccaag 
cttaatgcta 
tatgcctaga 



gaactgataa 
atgggactcg 
atatgttttc 
gagtgtttgg 
cattaattga 
ttatttcgat 
cacacatatg 
aaatattttc 
catcttgaaa 
taacaggtct 
ctaacctgca 
aagaccgtgg 
tggcttgttt 
tttttatctg 
gagcttgtct 
atgtgaatca 
cccattagct 
aatattatat 
ttaagttgaa 
aaaaaaaaaa 



tcctgctttc 
tttgtgaagc 
taagataatt 
gggataagtt 
tgttactctg 
gaagctataa 
cacatgtatt 
catgccttgg 
agagcttttt 
gtgttgtcta 
aactgattct 
gcccaatgat 
tctagagttt 
catgcccagt 
tcataatttt 
tgccacccca 
tggtactaac 
tcttcaaaat 
cttttgaatg 
aaaa 



aaccacattg 
tgctctgata 
ggatcaggtt 
tttgttaaaa 
gagtttagaa 
ttgtgaaaat 
tgattgtcaa 
tgctagcata 
tccctcctac 
gatctttgct 
gaatcaggaa 
caaagtcttc 
ctgggctttg 
ttctgaccta 
ctgtttattg 
aatcctggct 
atatacttga 
ggttaagctc 
tcaactaatt 



aattgaactc 
ctagatatcc 
ttctttgtgt 
caaagtctgg 
ttctggcatt 
tgtaactaca 
aggtatattc 
taagtttgga 
cacatacacc 
ttttatcccc 
ggtggtaatc 
ttggtgctgt 
gctgctgata 
tcaacttggg 
ccctgggctt 
tatcaagtcc 
tgtaggtatt 
catggagtta 
tgcaatcaat 



agatcagagt 
ttaaatgatg 
gtgtgtgtgt 
actatcttca 
gacatttccc 
tagatgctga 
ttaaatttgg 
agtttgccaa 
attcttaggg 
ctatcagtcc 
aataagtatt 
tcattaattc 
ctgcctttct 
ttttattgtg 
ggatatgtct 
cagactataa 
tatggacttg 
gatgactaca 
taaagataca 



2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3264 



<210> 64 

<211> 1659 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 
<223> Incyte ID No: 



7487393CB1 



<400> 64 

atgggctttg 

atagctttct 

actgcattca 

gacaatgata 

tcaaacctga 

ctgaacggga 

gtgtacgaca 

tctcagtcac 

ctaatatatg 

ctccagctgg 

atactgcgct 

agcttagagt 

tacagtgttg 

ttgcaactga 

gagtctgctc 

agagttgcac 

agatccacca 

ttccgtgcac 

actgtaccct 

ttccagattc 

aatcatatgg 

ctggtcaaca 

ggaattggta 

cccaccatat 

gcactggctc 

tatggagtct 

ctacctcttc 

aagcaggaag 



atgtgctcct 
tttgcatcac 
cccctagtca 
ccgggaccct 
ggccacagaa 
ccttccccaa 
gaagctcttt 
taaaatcaat 
gccatctttc 
ccatctctaa 
tcttggcagg 
ggacattgcc 
ggcagatgct 
ctgtgtctac 
ggtggctgat 
acataaatgg 
tgaagaagga 
ccaaattgcg 
tttatggcct 
tctgtggagc 
gtcgtcgaat 
cctttttgcc 
gtgtttctgc 
tgaggtcaac 
ctctgttgat 
tccccatcct 
ctaacaccat 
atacttgcat 



ggatcaagtg 
caacatccta 
tcgctgctgg 
cagcaaggat 
gtgtcagcgc 
cacaaatgag 
cctctccacc 
ggttcaatcc 
agacagggtt 
cacctgtgcg 
gttctccacc 
ccggtcacga 
cctaggaggg 
acccataatt 
tatcaacaat 
aaaaaagaat 
gttggatgca 
aatgagagtc 
gatactcaac 
tgtcacattc 
aagccagata 
ccaagaaatg 
tgctagcaac 
agttgcagga 
gaccttaatg 
tgctgtccct 
ccaggatgtg 
gaaagtaaca 



ggtggcatgg 
ctgttcccta 
gtccccctcc 
gacctcctga 
tttatccatc 
ccagacacgg 
atcgtgactg 
ctatttatgg 
ggacggaaga 
gccttcgctc 
atgactattt 
tctatgacaa 
ctggcttttg 
gtcctcttct 
cagctagatg 
actgaagaga 
gtccgaatta 
ttcggcctgt 
ttgcagcact 
acagccagat 
ttgttcacgt 
cagatcctgc 
agtgcttctg 
atcaatgcag 
gcgtattctc 
gttatcctcc 
gaaaatgaca 
cagttttaa 



ggagattcca 
atattgtgtt 
tggacaatga 
gaatctccat 
cccagtggca 
agccctgtgt 
agtgggacct 
ctgggtcact 
tcatatgcaa 
ccaccttcct 
tgggaaacac 
taatggtgct 
ccattcagga 
tgtcctcttg 
agggcttaaa 
cactgaccac 
aaacatccat 
gctttgtgag 
tagggagcaa 
gtgtttccct 
tcccggtggg 
gtgtggtttt 
tccaccacaa 
tgtccggtag 
cccacctacc 
tccttccaga 
gaaaagattc 



gatttgtctg 
ggagaacttc 
cactgtgtct 
cccactggac 
gctccttcac 
ggatggctgg 
ggtatgtgaa 
gctgggaggt 
attgtgtttc 
tgtttactgc 
ttttattctc 
attatgttcc 
ctggcacata 
gaagatggtg 
ggagcttaga 
tgagcttgtg 
tttttccctg 
attcgcaatc 
tgtctccctg 
tttgacactg 
acttttcatt 
agcaactttg 
cgagctcgtc 
gactggggca 
ctggatttcc 
aaccagggat 
aagaaacata 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1659 
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